Methods and systems of video processing

ABSTRACT

A method of processing a video includes capturing a first set of video data at a first definition, transmitting the first set of video data at a second definition lower than the first definition wirelessly to a user terminal, receiving a video edit request wirelessly from the user terminal, and finding video corresponding to edited video data described by the video edit request, thereby forming a second set of video data at a third definition. The video edit request is formed from editing the received first set of video data at the second definition at the user terminal.

CROSS-REFERENCE

This application is a continuation application of application Ser. No.15/598,170, filed on May 17, 2017, which is a continuation applicationof International Application No. PCT/CN2015/075458, filed on Mar. 31,2015, which claims priority to International Application No.PCT/CN2014/093785, Filed Dec. 14, 2014 and International Application No.PCT/CN2014/093786, filed on Dec. 14, 2014; is a continuation-in-part ofInternational Application No. PCT/CN2014/093785, filed on Dec. 14, 2014;and is a continuation-in-part International Application No.PCT/CN2014/093786, filed on Dec. 14, 2014, the contents of all of whichare hereby incorporated by reference in their entirety.

BACKGROUND

Aerial vehicles, such as unmanned aerial vehicles (UAVs), have beendeveloped for a wide range of applications including surveillance,search and rescue operations, exploration, and other fields. Such UAVscan often carry a camera module onboard for video capturing.

The video captured by UAVs can be displayed, edited and distributed.However, current video processing methods and systems for UAVs can beless than ideal. For example, excessive bandwidth may be consumed indownloading the captured video from a UAV, especially high definitionvideo.

SUMMARY

A need exists for improved methods and systems for editing images, suchas video, captured by image capture devices. In some instances, theimage capture devices may be supported by or integral to movableobjects, such as unmanned vehicles. Methods may be provided for editingvideo captured by image capture devices, such as image capture deviceson unmanned aerial vehicles (UAVs). The editing may occur on a userterminal, such as a handheld device. In some embodiments, the UAV mayhave an image capture device that captures video at a first definitionand transmits the captured video to the handheld device at a seconddefinition. The handheld device may receive and edit the captured videoat the second definition and form a video edit request. The video editrequest may be small in size and contain edited video data informationfor an edited video. The video edit request may then be transmitted tothe UAV and/or image capture device. The UAV and/or image capture devicemay analyze the edited video file, generate a video file correspondingto the edited video data information, and transmit the generated videofile to the handheld device or another device.

With the methods, devices and terminals of present disclosure, the userterminal may download the video at a lower definition from the imagecapture device before editing, rather than the high definition video.The user may edit the video at the user terminal, transmit a small videoedit request to the image capture device, and receive a high definitionvideo corresponding to the edited video from the image capture device.Therefore, the pressure on bandwidth for transmission and computationfor video editing may be alleviated, since not all the high definitionvideo is transmitted.

According to an aspect of the present disclosure, a method of processinga video is provided. The method comprises: receiving a first set ofvideo data at a user terminal, wherein the first set of video data isprovided from an image capture device remote to the user terminal andconfigured to capture a video at a first definition, wherein the firstset of video data at the user terminal is received at a seconddefinition; editing the received first set of video data at the seconddefinition, thereby forming a video edit request; transmitting the videoedit request to the image capture device; and receiving, from the imagecapture device, a second set of video data at a third definition,wherein the second set of video data corresponds to edited video data inthe video edit request.

According to another aspect of the present disclosure, a user terminalfor processing a video is provided. The user terminal comprises acommunication unit that permits communication with a remote imagecapture device, wherein the image capture device is configured tocapture a video at a first definition; and one or more processors thatindividually or collectively: receive a first set of video data, whereinthe first set of video data is provided from the image capture device,wherein the first set of video data is received at a second definition;edit the received first set of video data at the second definition,thereby forming a video edit request; generate a signal for thecommunication unit to transmit the video edit request to the imagecapture device; and receive a second set of video data from the imagecapture device at a third definition, wherein the second set of videodata corresponds to edited video data in the video edit request.

According to another aspect of the present disclosure, a computerprogram product for processing a video, the computer program productbeing encoded on non-transitory machine-readable storage media, isprovided. The computer program product comprises: instructions forreceiving a first set of video data at a user terminal, wherein thefirst set of video data is provided from an image capture device remoteto the user terminal and configured to capture a video at a firstdefinition, wherein the first set of video data at the user terminal isreceived at a second definition; instructions for editing the receivedfirst set of video data at the second definition, thereby forming avideo edit request; instructions for transmitting the video edit requestto the image capture device; and instructions for receiving, from theimage capture device, a second set of video data at a third definition,wherein the second set of video data corresponds to edited video data inthe video edit request.

According to another aspect of the present disclosure, a method ofprocessing a video is provided. The method comprises capturing, using animage capture device, a video at a first definition, and storing thevideo in memory; transmitting a first set of video data derived from thevideo to a user terminal remote to the image capture device at a seconddefinition; receiving, at the image capture device, a video edit requestfrom the user terminal, wherein the video edit request is formed fromediting the received first set of video data at the second definition atthe user terminal; finding corresponding video to edited video datadescribed by the video edit request in the memory, thereby forming asecond set of video data at a third definition; and transmitting, to theuser terminal, the second set of video data at the third definition.

According to another aspect of the present disclosure, an image capturedevice for processing a video is provided. The image capture devicecomprises an image sensor configured to capture a video at a firstdefinition; and one or more processors that individually orcollectively: generate a signal for the communication unit to transmit afirst set of video data derived from the video at a second definition;receive a video edit request generated at the user terminal, wherein thevideo edit request is formed from editing the received first set ofvideo data at the second definition at the user terminal; findcorresponding video to edited video data described by the video editrequest in a memory, thereby forming a second set of video data at athird definition; and generate a signal for a communication unit totransmit, to the user terminal, the second set of video data at thethird definition.

According to another aspect of the present disclosure, a processer isprovided. The processor is configured to: receive a video edit requestfrom a user terminal, wherein the video edit request is formed fromediting a received first set of video data at a second definition at theuser terminal, and wherein the first set of video data is provided froman image capture device remote to the user terminal and configured tocapture a video at a first definition; find corresponding video toedited video data described by the video edit request in a memory,thereby forming a second set of video data at a third definition,wherein the memory is configured to store the video at the firstdefinition; and generate a signal for a communication unit to transmitthe second set of video data at the third definition to the userterminal.

According to another aspect of the present disclosure, a method ofprocessing a video is provided. The method comprises capturing, at animage capture device, a video at a first definition, and storing thecaptured video in a memory; providing, from the image capture device, afirst set of video data derived from the captured video to a userterminal, wherein the first set of video data is provided at a seconddefinition; editing, at the user terminal, the first set of video dataat the second definition, thereby forming a video edit request;providing, from the user terminal, the video edit request to the imagecapture device; finding, at the image capture device, correspondingvideo to edited video data described by the video edit request in thememory, thereby forming a second set of video data at a thirddefinition; and providing, from the image capture device, the second setof video to the user terminal at the third definition.

According to another aspect of the present disclosure, a system forprocessing a video is provided. The system comprises an image capturedevice; and a user terminal in communication with the image capturedevice, wherein the image capture device comprises: an image sensorconfigured to capture a video at a first definition; a memory configuredto store the video at the first definition; one or more processors thatindividually or collectively: generate a signal to provide a first setof video data derived from the captured video to the user terminal,wherein the first set of video data is provided at a second definition;receive a video edit request generated at the user terminal, wherein thevideo edit request is formed from editing the first set of video data atthe second definition at the user terminal; find corresponding video toedited video data described by the video edit request in the memory,thereby forming a second set of video data at the first definition; andgenerate a signal to provide, to the user terminal, the second set ofvideo data at a third definition; wherein the user terminal comprises:one or more processors that individually or collectively: receive thefirst set of video data at the second definition; edit the receivedfirst set of video data at the second definition, thereby forming avideo edit request; generate a signal to provide the video edit requestto the image capture device; and receive the second set of video datafrom the image capture device at the third definition.

According to another aspect of the present disclosure, a method ofprocessing a video is provided. The method comprises determining, at auser terminal, whether a video captured by an image capture deviceremote to the user terminal is to be processed; selecting, at the userterminal, one or more video clips to be merged to the video captured bythe image capture device when the video is determined to be processed,thereby forming a video edit request; and transmitting the video editrequest to the image capture device to cause the image capture device toprocess the video by merging the one or more video clips with the video.

According to another aspect of the present disclosure, a user terminalfor processing a video is provided. The user terminal comprises acommunication unit that permits communication with a remote imagecapture device, wherein the image capture device is configured tocapture a video; one or more processors that individually orcollectively: determine whether a video captured by the image capturedevice is to be processed; select one or more video clips to be mergedto the video captured by the image capture device when the video isdetermined to be processed, thereby forming a video edit request; andgenerate a signal for the communication unit to transmit the video editrequest to the image capture device to cause the image capture device toprocess the video by merging the one or more video clips with the video.

According to another aspect of the present disclosure, a computerprogram product for processing a video captured by an image capturedevice, the computer program product being encoded on non-transitorymachine-readable storage media, is provided. The computer programproduct comprises instructions for determining whether the videocaptured by the image capture device is to be processed; instructionsfor selecting one or more video clips to be merged to the video capturedby the image capture device when the video is determined to beprocessed, thereby forming a video edit request; and instructions forgenerating a signal for a communication unit to transmit the video editrequest to the image capture device to cause the image capture device toprocess the video by merging the one or more video clips with the video.

Accordingly to another aspect of the present disclosure, non-transitorycomputer readable medium containing program instructions for processinga video captured by an image capture device is provided. Thenon-transitory computer readable medium comprises program instructionsfor determining whether the video captured by the image capture deviceis to be processed; program instructions for selecting one or more videoclips to be merged to the video captured by the image capture devicewhen the video is determined to be processed, thereby forming a videoedit request; and program instructions for generating a signal for acommunication unit to transmit the video edit request to the imagecapture device to cause the image capture device to process the video bymerging the one or more video clips with the video.

According to another aspect of the present disclosure, a method ofprocessing a video is provided. The method comprises capturing a videoat an image capture device; receiving, at the image capture device, avideo edit request from a user terminal remote to the image capturedevice, wherein the video edit request is formed from a selection of oneor more video clips to be merged to the video when a determination ismade at the user terminal that the video is to be processed; and mergingthe one or more video clips to the video captured at the image capturedevice based on the video edit request.

According to another aspect of the present disclosure, an image capturedevice for processing a video is provided. The image capture devicecomprises an image sensor configured to capture a video; a communicationunit that permits communication with a user terminal remote to the imagecapture device; and one or more processors that individually orcollectively: generates a signal for the image sensor to capture avideo; receives a video edit request from the user terminal, wherein thevideo edit request is formed from a selection of one or more video clipsto be merged to the video when a determination is made at the userterminal that the video is to be processed; and merges the one or morevideo clips to the video captured at the image sensor based on the videoedit request.

According to another aspect of the present disclosure, a processor isprovided. The processor is configured to receive a video edit requestfrom a user terminal remote to the processor, wherein the video editrequest is formed from a selection of one or more video clips to bemerged to the video when a determination is made at the user terminalthat the video is to be processed; and merge the one or more video clipsto the video captured at an image capture device based on the video editrequest.

According to another aspect of the present disclosure, a method ofprocessing a video is provided. The method comprises capturing a videoat an image capture device; and determining, at a user terminal, whetherthe video captured by the image capture device is to be processed;selecting, at the user terminal, one or more video clips to be merged tothe video captured by the image capture device when the video isdetermined to be processed, thereby forming a video edit request;providing, from the user terminal the video edit request to the imagecapture device; and merging, at the image capture device, the one ormore video clips to the video captured by the image capture device.

In another aspect of the present disclosure, a system for processing avideo is provided. The system comprises an image capture deviceconfigured to capture a video; and a user terminal that is incommunication with the image capture device, wherein the user terminalcomprises: one or more processors that individually or collectively:determine whether the video captured by the image capture device is tobe processed; select one or more video clips to be merged to the videocaptured by the image capture device when the video is determined to beprocessed, thereby forming a video edit request; and generate a signalto provide the video edit request to the image capture device; andwherein the image capture device comprises: an image sensor configuredto capture the video; one or more processors that individually orcollectively: receive the video edit request from the user terminal;generate a signal for the image sensor to capture a video; and merge theone or more video clips to the video captured at the image sensor.

It shall be understood that different aspects of the disclosure can beappreciated individually, collectively, or in combination with eachother. Various aspects of the disclosure described herein may be appliedto any of the particular applications set forth below or for any othertypes of movable objects. Any description herein of aerial vehicles,such as unmanned aerial vehicles, may apply to and be used for anymovable object, such as any vehicle. Additionally, the systems, devices,and methods disclosed herein in the context of aerial motion (e.g.,flight) may also be applied in the context of other types of motion,such as movement on the ground or on water, underwater motion, or motionin space.

Other objects and features of the present disclosure will becomeapparent by a review of the specification, claims, and appended figures.

INCORPORATION BY REFERENCE

All publications, patents, and patent applications mentioned in thisspecification are herein incorporated by reference to the same extent asif each individual publication, patent, or patent application wasspecifically and individually indicated to be incorporated by reference.

BRIEF DESCRIPTION OF THE DRAWINGS

The novel features of the invention are set forth with particularity inthe appended claims. A better understanding of the features andadvantages of the present disclosure will be obtained by reference tothe following detailed description that sets forth illustrativeembodiments, in which the principles of the disclosure are utilized, andthe accompanying drawings of which:

FIG. 1 shows a schematic of an unmanned aerial vehicle (UAV) that isused to capture video, and a user terminal, in accordance with anembodiment of the disclosure.

FIG. 2 is a flow chart illustrating a method of processing a video on auser terminal in accordance with an embodiment of the disclosure.

FIG. 3 is a flow chart illustrating a method of processing a video on animage capture device in accordance with an embodiment of the disclosure.

FIG. 4 is a flow chart illustrating a method of processing a video in animage processing system in accordance with an embodiment of thedisclosure.

FIG. 5 illustrates exemplary flow of image-related data among imagecapturing components in an image processing system, in accordance withan embodiment of the disclosure.

FIG. 6 illustrates exemplary flow of image-related data further amongimage editing components in an image processing system, in accordancewith an embodiment of the disclosure.

FIG. 7 illustrates an exemplary editing file, such as a video editrequest, that may be generated at a user terminal of an image processingsystem, in accordance with an embodiment of the disclosure.

FIG. 8 illustrates exemplary flow of edited image-related data furtheramong image generating components in an image processing system, inaccordance with an embodiment of the disclosure.

FIG. 9 is a flow chart illustrating a method of processing a video at auser terminal, in accordance with an embodiment of the disclosure.

FIG. 10 is an example of content in a video edit request as formed at auser terminal, in accordance with an embodiment of the disclosure.

FIG. 11 is a flow chart illustrating a method of preparing video clipsfor a prologue, a transition effect and an epilogue at a user terminal,in accordance with an embodiment of the disclosure.

FIG. 12 is a flow chart illustrating a method of merging a prologue, atransition effect and an epilogue at an image capture device, inaccordance with an embodiment of the disclosure.

FIG. 13 is a flow chart illustrating a method of merging a prologue, atransition effect and an epilogue at an image capture device, inaccordance with an embodiment of the disclosure.

FIG. 14 is a schematic illustration by way of block diagram of a systemfor processing a video, in accordance with an embodiment of thedisclosure.

FIG. 15 is a flow chart illustrating a method of processing a video, inaccordance with an embodiment of the disclosure.

FIG. 16 is diagram illustrating a merged video produced by a method ofprocessing a video, in accordance with an embodiment of the disclosure.

FIG. 17 is a flowchart showing a video processing method according to anembodiment of present disclosure.

FIG. 18 is a flowchart showing a video processing method according toanother embodiment of present disclosure.

FIG. 19 is a flowchart showing a video processing method according tostill another embodiment of present disclosure.

FIG. 20 is a schematic view showing a configuration of a videoprocessing device according to an embodiment of present disclosure.

FIG. 21 is a schematic view showing a configuration of a videoprocessing device according to another embodiment of present disclosure.

FIG. 22 is a schematic view showing a configuration of a receivingmodule in the video processing device according to the embodiment shownin FIG. 20 or FIG. 21.

FIG. 23 is a schematic view showing a configuration of a recordingmodule in the video processing device according to the embodiment shownin FIG. 20 or FIG. 21.

FIG. 24 is a schematic view showing a configuration of an editing modulein the video processing device according to the embodiment shown in FIG.21.

FIG. 25 is a schematic view showing a configuration of a synthesizingmodule in the video processing device according to the embodiment shownin FIG. 21.

FIG. 26 is a schematic view showing a configuration of a processingmodule in the video processing device according to the embodiment shownin FIG. 21.

FIG. 27 is a schematic view showing a configuration of a playing deviceaccording to an embodiment of present disclosure.

FIG. 28 is a flowchart showing a video processing method according to anembodiment of present disclosure.

FIG. 29 is a flowchart showing a video processing method according toanother embodiment of present disclosure.

FIG. 30 is a schematic view showing a configuration of a videoprocessing device according to an embodiment of present disclosure.

FIG. 31 is a schematic view showing a configuration of a videoprocessing device according to another embodiment of present disclosure.

FIG. 32 is a schematic view showing a configuration of an acquiringmodule in the video processing device according to the embodiment shownin FIG. 30 or FIG. 31.

FIG. 33 is a schematic view showing a configuration of an editing modulein the video processing device according to the embodiment shown in FIG.30 or FIG. 31.

FIG. 34 is a schematic view showing a configuration of a collectingmodule in the video processing device according to the embodiment shownin FIG. 31.

FIG. 35 is schematic view showing a configuration of a screening modulein the video processing device according to the embodiment shown in FIG.31.

FIG. 36 is a schematic view showing a configuration of a displayingdevice according to an embodiment of present disclosure.

FIG. 37 shows a computer control system that is programmed or otherwiseconfigured to implement methods provided herein.

DETAILED DESCRIPTION

The methods, devices and terminals described herein provide an effectiveapproach for efficiently transmitting and editing video captured by animage capture device, such as an image capture device on-board orintegral to a movable object such as an unmanned aerial vehicle (UAV).The methods, devices and terminals described herein can be used tocapture video at high definition, transmit and edit video at lowerdefinition, and obtain and distribute an edited video at highdefinition. The methods, devices and terminals disclosed herein can beapplied to any suitable movable object or stationery object. A movableobject may be capable of self-propelled movement (e.g., a vehicle),while a stationary object may not be capable of self-propelled movement.In some embodiments, the movable object may be a UAV.

Methods may be provided for editing video captured by image capturedevices, such as image capture devices on unmanned aerial vehicles(UAVs). The editing may occur on a user terminal, such as a handhelddevice. In some embodiments, the UAV may have an image capture devicethat captures video at a first definition and transmits the capturedvideo to the handheld device at a second definition. The handheld devicemay receive and edit the captured video at the second definition andform a video edit request. The video edit request may be small in sizeand contain edited video data information for an edited video. The videoedit request may then be transmitted to the UAV and/or image capturedevice. The UAV and/or image capture device may analyze the edited videofile, generate a video file corresponding to the edited video datainformation, and transmit the generated video file to the handhelddevice or another device.

Image capture devices may be used for capturing video. In someembodiments, the image capture device may capture video at a firstdefinition (e.g., high definition), store the captured video in a memoryand transmit the captured video to a user terminal at a seconddefinition that is lower than the first definition. The image capturedevice may then receive a video edit request from the user terminal. Thevideo edit request may contain edited video data information on anedited video. The image capture device may generate a video file at thefirst definition corresponding to the edited video data information, andtransmit the generated video file to the user terminal. In someembodiments, the image capture device may be supported by or may be partof a UAV. Any description herein of an image capture device of a UAV mayapply to other image capture devices that are supported by or part ofother movable objects, or non-movable objects. Additionally, anydescription herein of an image capture device of a UAV may apply to anytype of image capture device, such as high-definition image capturedevices.

User terminals may be used for editing video. In some embodiments, theuser terminal may receive video at a second definition from an imagecapture device which captures video at a first definition higher thanthe second definition. The user terminal may edit the received video,form a video edit request and transmit the video edit request to theimage capture device. The video edit request may be small in size andcontain edited video data information on the edited video. The userterminal may then receive from the image capture device a video filewhich is generated at the first definition corresponding to the editedvideo data information.

The user terminal may download the video at a lower definition from theimage capture device, rather than the high definition video. A user atthe user terminal may edit the video at the user terminal, transmit avideo edit request to the image capture device, and receive a highdefinition video corresponding to the edited video, from the imagecapture device. Therefore, the pressure on bandwidth for transmissionand computation for video editing may be alleviated, since not all thehigh definition video is transmitted. Systems and methods providedherein may advantageously permit editing of video that was captured at ahigh definition with reduced amounts of transmission delay and requiredprocessing power.

FIG. 1 shows a schematic of an unmanned aerial vehicle (UAV) 100 and auser terminal 110 in accordance with an embodiment of the disclosure.Any description herein of a UAV 100 may apply to any type of movableobject, such as an aerial vehicle. The description of a UAV may apply toany type of unmanned movable object (e.g., which may traverse the air,land, water, or space). The UAV may support an image capture device ormay have an image capture device integrally formed thereon that is usedto capture video.

The UAV 100 may have a body 120. In some instances, the body 120 may bea central body which may have one or more branching members, or “arms.”The arms may extend outward from the body in a radial manner and bejoined via the body. In some instances, the arms may form T-bars. Insome instances, the arms may or may not be movable relative to the restof the body. The number of arms may or may not match the number ofpropulsion units, or rotors, of the UAV. For example, the UAV may be aquadcopter having four propulsion units, each supported by a separatearm of the UAV. In another example, a UAV may be a quadcopter havingfour propulsion units, where two propulsion units are supported on theends of a T-shaped arm of the UAV. The body may comprise a housing. Thehousing may enclose one or more components of the UAV within thehousing. In some instances, one or more electrical components of the UAVmay be provided within the housing. For example, a flight controller ofthe UAV may be provided within the housing. The flight controller maycontrol operation of one or more propulsion units 160 of the UAV. Thepropulsion units 160 may each include the rotors and/or motors.

The UAV 100 may carry a variety of sensors onboard, including but notlimited to laser sensor, lidar, infrared sensor, inertial measurementunit (IMU), global positioning system (GPS) sensor, audio collectingsensor, and optical sensor. The sensors and propulsion units may bepowered by a battery assembly. The battery assembly may include one ormore batteries, which may be distributed within a housing of the UAV.

The UAV 100 may include an image sensor 122, a memory, 124, a processor126, and a communication system 128. Additionally, UAV 100 may includean image capture device 180. The image capture device may be integrallyformed as part of the UAV or may be a separable component supported bythe UAV. The image sensor 122 may be a component of image capture device180, or image sensor 122 may interact with the image capture device 180such that image sensor 122 receives image data that is captured by theimage capture device 180 of UAV 100. Similarly, the memory 124,processor 126, and/or communication system 128 may be a part of theimage capture device 180 or may be part of the UAV outside the imagecapture device that may interact with the image capture device 180 orthe image sensor 122.

Additionally, FIG. 1 illustrates user terminal 110. The user terminal110 may be any type of device having data processing capacity andcommunicating capacity. In some embodiments, the user terminal 110 maybe a mobile device, such as a smartphone (e.g., iPhone, Android phone,Windows mobile phone, Blackberry, etc.) or tablet (e.g., iPad, Androidtablet, Surface tablet). The user terminal may be any other type ofdevice, such as a laptop computer, desktop computer, or wearable object.The user terminal 110 may be able to communicate with an image capturedevice 180 that is remote to the user terminal 110. The user terminalmay communicate directly with the image capture device or a UAV 100 thatmay support the image capture device. In some instances, the UAV and theimage capture device may be one and the same. Any description herein ofcommunication with an image capture device may refer to communicationwith the image capture device without communication with a UAV,communication with a UAV that supports or is integral to the imagecapture device, or communication with both an image capture device andthe UAV. The user terminal may communicate with the image capture deviceover a wireless link, including but not limited to, a radio frequency(RF) link, a Wi-Fi link, a blue tooth link, a 2G link, a 3G link, a 4Glink, or a LTE link. The communication may occur directly between theuser terminal and the image capture device, or may traverse a network,such as a telecommunications network.

The user terminal 110 may include a communication system 116, memory117, and processor 118. The communication system 116 of the userterminal may be able to interact with a communication system 128 of theUAV 100 via wireless signals 112. The communication systems 128, 116 mayinclude any number of transmitters, receivers, and/or transceiverssuitable for wireless communication. The communication may be one-waycommunication, such that data can be transmitted in only one direction.For example, one-way communication may involve only the UAV 100transmitting data to the user terminal 110, or vice-versa.

For example, one-way communication may include image data, such asvideo, as well as additional information such as video edit requests.For example, video data may be transmitted from one or more transmittersof the communication system 128 to one or more receivers of thecommunication system 116. The video data may then be stored in memory117 and edited using processor 118. Once a video edit request isgenerated at user terminal 110, the video edit request may betransmitted from one or more transmitters of the communication system116 to one or more receivers of the communication system 128.

Alternatively, the communication may be two-way communication, such thatdata can be transmitted in both directions between the UAV 100 and theuser terminal 110. The two-way communication can involve transmittingdata from one or more transmitters of the communication system 128 toone or more receivers of the communication system 116, and vice-versa.The two-way communication may include video data from the image capturedevice and/or UAV to the user terminal, and one or more video editrequests from the user terminal to the image capture device and/or UAV.In some instances, communications may include one or more command signalfrom the user terminal to the image capture device and/or UAV. Forinstance, a command signal may be used to control operation and/orflight of the UAV. Thus, the user terminal that may be used for videoediting may also be used to control operation and/or flight of the UAV.In another example, the command signal may be used to control operationof one or more sensors or other components on-board the UAV. Thus, theuser terminal that may be used for video editing may be used to controlcollection of data of the UAV. The command signal may be used to controloperation and/or positioning of the image capture device relative to theUAV. Thus, the user terminal that may be used for video editing may beused to control operation and/or positioning of the image capturedevice. The same device may be used to control any or all of thesefunctions. Alternatively, one or more different devices may be used tocontrol these various functions.

In some embodiments, the image capture device 180 may be a camera. Thecamera 180 may be rigidly coupled to the UAV 100. Alternatively, thecamera 180 may be permitted to move relative to the UAV 100. In someinstances, the camera may be permitted to move with respect to up to sixdegrees of freedom relative to the UAV. The camera 180 may be directlymounted onto the UAV 100, may be incorporated within the UAV 100, or maybe coupled to a carrier mounted onto the UAV 100. In some embodiments,the carrier may be a gimbal. The carrier may permit the camera to rotateabout one, two, three, four or more axes of rotation. This may permitthe field of view captured by the camera to be altered. In someembodiments, the camera 180 may be an element of a payload of the UAV100.

The camera 180 may capture images of an environment of the UAV 100. Theimages may include still images (e.g., snapshots) and/or video images.The camera 180 may continuously capture video and/or snapshots.Alternatively, the camera 180 may capture video and/or snapshots at aspecified frequency to produce a series of image data and/or video dataover time. In some embodiments, the video may be captured simultaneouslywith a recording of environment audio.

In some embodiments, the captured video and/or images may be stored in amemory onboard the UAV 100, such as memory 124. The memory 124 may beon-board the image capture device 180 or may be separate from the imagecapture device. The memory may include non-transitory computer readablemedium that may include one or more memory units (e.g., removable mediaor external storage such as a Secure Digital (SD) card, or a randomaccess memory (RAM), or a read only memory (ROM) or a flash memory).Alternatively, the captured video and/or images may be transmitted to aremote terminal. The transmission of captured video and/or images may beimplemented over a wireless link, including but not limited to, a radiofrequency (RF) link, a Wi-Fi link, a blue tooth link, a 2G link, a 3Glink, or a LTE link.

The camera 180 may comprise an image sensor 122, and one or more opticalelements, such as lenses. The one or more lenses may be configured todirect light to the image sensor. An image sensor is a device thatconverts an optical image into an electronic signal. The image sensor ofthe camera 180 may be a charge-coupled device (CCD) type, acomplementary metal-oxide-semiconductor (CMOS) type, a N-typemetal-oxide-semiconductor (NMOS) type, or a back-side illuminated CMOS(BSI-CMOS) type. Optionally, image data from the image sensor may bestored in memory 124 on-board or off-board the image capture device. Thememory 124 may be on-board a UAV.

A focal length of an optical system is a measure of how strongly thesystem converges or diverges light. The focal length of an optical lensis the distance over which initially collimated rays are brought to afocus. There are two types of lenses: prime and zoom. A prime lens mayhave a fixed focal length and the focal length may encompass a singlefocal length. A zoom lens may have variable focal lengths and the focallength may encompass a plurality of focal lengths. In some embodiments,the lens of the camera 180 may be a zoom lens. Alternatively, the lensof the camera 180 may be a prime lens.

In some embodiments, a lower limit of the focal length range for a zoomlens suitable for the image capture device of the present disclosure maybe less than or equal to 1 mm, 2 mm, 4 mm, 6 mm, 8 mm, 10 mm, 15 mm, 20mm, 25 mm, 30 mm, 35 mm, 40 mm, 45 mm, 50 mm, 55 mm, 60 mm, 65 mm, 70mm, 75 mm, 80 mm, 85 mm, 90 mm, 95 mm, 100 mm, 105 mm, 110 mm, 120 mm,130 mm, 140 mm, 150 mm, 160 mm, 170 mm, 180 mm, 190 mm, 200 mm, 220 mm,240 mm, 260 mm, 280 mm, 300 mm, 320 mm, 340 mm, 360 mm, 380 mm, 400 mm,450 mm, 500 mm, 550 mm, 600 mm, 650 mm, 700 mm, 750 mm, 800 mm, 850 mm,900 mm, 950 mm, 1000 mm, or 1150 mm. Optionally, the lower limit of thefocal length range for the image capture device of the presentdisclosure may be less than or equal to any of the values describedherein. The lower limit of the focal length range for a zoom lenssuitable for the image capture device of the present disclosure may havea value falling within a range between any two of the values describedherein.

In some instances, an upper limit of the focal length range for a zoomlens suitable for the image capture device of the present disclosure maybe less than or equal to 2 mm, 5 mm, 10 mm, 15 mm, 20 mm, 25 mm, 30 mm,35 mm, 40 mm, 45 mm, 50 mm, 55 mm, 60 mm, 65 mm, 70 mm, 75 mm, 80 mm, 85mm, 90 mm, 95 mm, 100 mm, 105 mm, 110 mm, 120 mm, 130 mm, 140 mm, 150mm, 160 mm, 170 mm, 180 mm, 190 mm, 200 mm, 220 mm, 240 mm, 260 mm, 280mm, 300 mm, 320 mm, 340 mm, 360 mm, 380 mm, 400 mm, 450 mm, 500 mm, 550mm, 600 mm, 650 mm, 700 mm, 750 mm, 800 mm, 850 mm, 900 mm, 950 mm, 1000mm, 1150 mm, or 2000 mm. Optionally, the upper limit of the focal lengthrange for a zoom lens suitable for the image capture device of thepresent disclosure may be less than or equal to any of the valuesdescribed herein. The upper limit of the focal length range for a zoomlens suitable for the image capture device of the present disclosure mayhave a value falling within a range between any two of the valuesdescribed herein.

In some instances, a focal length of a prime lens suitable for the imagecapture device of the present disclosure may be less than or equal to 1mm, 2 mm, 4 mm, 6 mm, 5 mm, 6 mm, 8 mm, 10 mm, 12 mm, 14 mm, 16 mm, 18mm, 20 mm, 22 mm, 24 mm, 26 mm, 28 mm, 30 mm, 35 mm, 40 mm, 45 mm, 50mm, 55 mm, 60 mm, 65 mm, 70 mm, 75 mm, 80 mm, 85 mm, 90 mm, 95 mm, 100mm, 105 mm, 110 mm, 120 mm, 130 mm, 140 mm, 150 mm, 160 mm, 170 mm, 180mm, 190 mm, 200 mm, 220 mm, 240 mm, 260 mm, 280 mm, 300 mm, 320 mm, 340mm, 360 mm, 380 mm, 400 mm, 450 mm, 500 mm, 550 mm, 600 mm, 650 mm, 700mm, 750 mm, 800 mm, 850 mm, 900 mm, 950 mm, 1000 mm, 1150 mm, or 1200mm. Optionally, the focal length of a prime lens suitable for the imagecapture device of the present disclosure may be less than or equal toany of the values described herein. The focal length of a prime lenssuitable for the image capture device of the present disclosure may havea value falling within a range between any two of the values describedherein.

In photography, the field of view (FOV) is that part of the world thatis visible through the camera at a particular position and orientationin space; objects outside the FOV when the picture is taken are notrecorded in the photograph. It is most often expressed as the angularsize of the view cone, as an angle of view. For normal lens, field ofview can be calculated as FOV=2 arctan(d/2f), where d is image sensorsize, and f is focal length of the lens. For an image sensor having afixed size, the prime lens may have a fixed FOV and the FOV mayencompass a single FOV angle. For an image sensor having a fixed size,the zoom lens may have variable FOV angular range and the FOV angularrange may encompass a plurality of FOV angles.

In some instances, a FOV of a prime lens suitable for the image capturedevice of the present disclosure may be less than or equal to 170°,169°, 165°, 160°, 155°, 150°, 145°, 140°, 135°, 130°, 125°, 120°, 115°,110°, 105°, 100°, 95°, 90°, 85°, 80°, 75°, 70°, 65°, 60°, 55°, 50°, 45°,40°, 35°, 30°, 25°, 20°, 15°, 10°, 5°, 3°, 2°, or 1°. Optionally, theFOV of a prime lens suitable for the image capture device of the presentdisclosure may be less than or equal to any of the values describedherein. The FOV of a prime lens suitable for the image capture device ofthe present disclosure may have a value falling within a range betweenany two of the values described herein.

In some instances, a lower limit of the FOV angular range for a zoomlens suitable for the image capture device of the present disclosure maybe less than or equal to 170°, 169°, 165°, 160°, 155°, 150°, 145°, 140°,135°, 130°, 125°, 120°, 115°, 110°, 105°, 100°, 95°, 90°, 85°, 80°, 75°,70°, 65°, 60°, 55°, 50°, 45°, 40°, 35°, 30°, 25°, 20°, 15°, 5°, 3°, 2°,or 1°. Optionally, the lower limit of the FOV angular range for a zoomlens suitable for the image capture device of the present disclosure maybe less than or equal to any of the values described herein. The lowerlimit of the FOV angular range for a zoom lens suitable for the imagecapture device of the present disclosure may have a value falling withina range between any two of the values described herein.

In some instances, an upper limit of the FOV angular range for a zoomlens suitable for the image capture device of the present disclosure maybe less than or equal to 170°, 169°, 165°, 160°, 155°, 150°, 145°, 140°,135°, 130°, 125°, 120°, 115°, 110°, 105°, 100°, 95°, 90°, 85°, 80°, 75°,70°, 65°, 60°, 55°, 50°, 45°, 40°, 35°, 30°, 25°, 20°, 15°, 5°, 3°, 2°,or 1°. Optionally, the upper limit of the FOV angular range for a zoomlens suitable for the image capture device of the present disclosure maybe less than or equal to any of the values described herein. The upperlimit of the FOV angular range for a zoom lens suitable for the imagecapture device of the present disclosure may have a value falling withina range between any two of the values described herein.

An image capture device 180 may be a high-resolution image capturedevice. In some embodiments, the image capture device may be a 4K imagecapture device capable of achieving 4K resolution. The image capturedevice may be capable of achieving a horizontal resolution of greaterthan or equal to about 1000 pixels, 2000 pixels, 2500 pixels, 3000pixels, 3500 pixels, 4000 pixels, 4500 pixels, 5000 pixels, 5500 pixels,6000 pixels, 7000 pixels, 8000 pixels, or 10000 pixels. In someinstances, the image capture device may achieve a horizontal resolutionof less than or equal to about 4000 pixels, 6000 pixels, 8000 pixels,10000 pixels, 12000 pixels, 15000 pixels, 20000 pixels, or 30000 pixels.The horizontal resolution of any of the images captured may fall betweenany of the values described herein. A high resolution camera may becapable of storing, in a memory, images at the resolution at which theywere captured. Alternatively, images may be stored at a lower resolutionthan at which they were captured. The images may be stored in highresolution, such as any of the resolutions described herein. The images(e.g., video or still images) transmitted to a user terminal 110 may beat the resolution the images were captured or stored, or at a lowerresolution from the resolution the images were captured or stored. Insome instances, the resolution of the images transmitted to the userterminal may be less than about 100%, 95%, 90%, 80%, 70%, 60%, 50%, 40%,30%, 20%, 10%, or 5% of the resolution at which the images are capturedand/or stored. In some embodiments, transmitting the images at a lowerresolution may advantageously reduce delay in data transmission andreduce the bandwidth taken up by the image data.

FIG. 2 is a flow chart illustrating a method 200 of processing a videoon a user terminal in accordance with an embodiment of the disclosure.

In step 202, a first set of video data is received. In particular, thefirst set of video data may be received from an image capture device.Any description herein of receiving video data from an image capturedevice may include receiving the video directly from the image capturedevice, or receiving the video from an object, such as a UAV, that maysupport or include the image capture device. In some embodiments, thefirst set of video data may include live streaming video captured by theimage capture device in real-time. In other instances, the first set ofvideo data may include video that has been stored in memory and thatdoes not need to be live or in real-time. The video may also be bufferedat the user terminal. For example, live feed may be provided to the userterminal in at a time-delay so that a user may interact with the dataahead of the live stream. Additionally, the video data may include oneor more video clips.

The first set of video data may be captured by the image capture deviceat a first definition, and may be provided to the user terminal at asecond definition. The first and second definitions may be the same ormay be different. In some instances, the second definition may be lowerthan the first definition (e.g., the image having the first definitionmay have a higher resolution than the image having the seconddefinition). In particular, the first set of video data may be reducedin definition prior to being transmitted from the image capture deviceto the user terminal so as to lessen the taxing bandwidth required totransmit the video data and to process the video data once it isdownloaded to the user terminal. As such, the first set of video datamay be captured at a first definition and may be transmitted to the userterminal at a second definition that is lower than the first definition.In other examples, however, there may not be bandwidth restrictions onthe transmitting and processing of video data. In these cases, the firstdefinition and the second definition of the video data may be equal.

At step 204, the first set of video data is edited. In particular, thefirst set of video data is edited at the user terminal, thereby forminga video edit request. A video edit request may include editinginformation that provides instructions for a video capture device toedit video associated with the video edit request. For example, theimage capture device may store the captured video at a memory accessibleby the image capture device, and may send a copy of the captured videoto the user terminal in the form of a first set of video data. Then,when the user edits this first set of video data, the user may only needto provide instructions to the image capture device on how to edit videothat is stored at the image capture device in order to fulfill a videoedit request from the user terminal, since another copy of the video isaccessible the image capture device at the memory.

A user at a user terminal may interact with the user terminal to editthe video. For example, the user terminal may include a display thatshows a graphical representation of the video to be edited. Thegraphical representation may include a timeline or other time-basedfeatures. The user may perform editing features, such as cutting a videoclip, deleting portions of a video clip, adding a video clip, insertinga video clip, modifying a video clip, altering speed of playback of avideo clip such as presenting the video clip in a slow motion mode orpresenting the video clip in a fast playing mode, adding one or morevisual effects or filters to a video clip, adding audio to a video clip,modifying audio of a video clip, or any other editing tasks, such asthose described in greater detail elsewhere herein. The user terminalmay track the edits made and form the video edit request. The userterminal may form the video edit request based on the final edited videoat the user terminal. Any description herein of transmitting a videoedit request or any other type of data to an image capture device mayinclude transmitting the video edit request or other data directly tothe image capture device, or transmitting the video request or otherdata to an object, such as a UAV, that may support or include the imagecapture device.

After the video edit request is formed based on editing the first set ofvideo data at step 204, the video edit request is transmitted to theimage capture device at step 206. In particular, the video edit requestis based on the editing of the first set of video data and includesinformation related to how to edit video data that is associated withthe first set of video data. For this example, the original capturedvideo data that matches the first set of video data, and is stored bythe image capture device, is considered to be associated with the firstset of video data. As previously described, any description herein ofstoring or editing image data by the image capture device may apply tothe camera itself or an object, such as a UAV that may support orinclude the camera.

At step 208, the user terminal receives a response to its video editrequest. In particular, at step 208, the user terminal receives a secondset of video data that corresponds to the video edit request. In oneexample, the second set of video data may be generated when the imagecapture device retrieves video associated with the video edit request,edits the video based on the video edit request, and provides a secondset of video data that is responsive to the video edit request to theuser terminal. The second set of video data may be provided at a thirddefinition. In some embodiments, the third definition may be the same asthe first definition at which the image was captured. In otherinstances, the third definition may be lower than the first definitionat which the image was captured. The third definition may be higher thanthe second definition that was provided to the user terminal in thefirst set of video data.

FIG. 3 is a flow chart 220 illustrating a method of processing a videoon an image capture device in accordance with an embodiment of thedisclosure. In step 222, a video is captured at a first definition. Inparticular, the video is captured at a first definition by an imagecapture device. The image capture device may be on or part of a movableobject, such as a UAV 100 and the video may be captured at a highdefinition. Once the video is captured, the video may be stored in amemory. The memory may be on-board the image capture device and/ormovable object. Any description of memory herein may apply to one ormore memory storage units which may be on a single object or distributedover multiple objects. In some embodiments, the captured video data maybe stored in the memory at the first definition (e.g., high definition,such as the high resolutions described elsewhere herein). Alternatively,the captured video data may be stored in the memory at a differentdefinition than the first definition (e.g., at a lower definition). Insome instances, a single copy of the video data may be stored in thememory. Alternatively, multiple copies of the video data may be storedin the memory. The multiple copies may be at the same definitions or atdifferent definitions. Additionally, a copy of at least a portion of thevideo may be provided to a user terminal as a first set of video data.The first set of video data may be provided at a second definition,where the second definition may be equal to the first definition or thesecond definition may be at a lower definition. When the first set ofvideo data is provided at a lower definition, the transmitting of thefirst set of video data may require less bandwidth, thereby providingadvantages in speed and efficient use of data network coverage.

As such, at step 224, the first set of video data is transmitted to auser terminal. Once the first set of video data is received at the userterminal, the user may interact with the data so as to create a videoedit request. The video edit request may include information related tothe first set of video data and instructions on how to edit the copy ofthe captured video that is stored at the memory. In this way, the userterminal may provide an editing file to the image capture device andrequest an edited video based on the first set of video informationwithout having to send back the first set of video data itself. Instead,the user terminal may request that the image capture device retrievecorresponding video from the memory that has the same video clips asthose requested in the edited file. In other examples, the video clipsretrieved from the memory may have the same content as those requestedin the video edit request, but may have a higher definition.

Once a video edit request has been generated at the user terminal andtransmitted to the image capture device, the video edit request isreceived at the image capture device at step 226. After the video editrequest is received, the image capture device may retrieve video clipsassociated with the video edit request and may generate a second set ofvideo data based on the information in the video edit request at step228. For example, the image capture device may retrieve selected videoclips and arrange them in an order described in the video edit request.The image capture device may generate the second set of video data tomirror the edits that are made at the user terminal. The second set ofvideo data may be provided at a different definition (e.g., higherdefinition) from the video edited at the user terminal. In someinstances, the second set of video data may include editing featuresthat were performed at the user terminal, such as cutting a video clip,deleting portions of a video clip, adding a video clip, inserting avideo clip, modifying a video clip, altering speed of playback of avideo clip, adding one or more visual effects or filters to a videoclip, adding audio to a video clip, modifying audio of a video clip, orany other editing tasks, such as those described in greater detailelsewhere herein. In some instances, the second set of video data may begenerated as a single video file with the edited features.

Once the second set of video data has been generated at the imagecapture device, the second set of video data may be transmitted to theuser terminal at step 230. Additionally, since the second set of videodata may be retrieved from the memory, which stores video at a firstdefinition, the second set of video data may be transmitted to the userdevice at a third definition that matches the first definition.Alternatively, the video clips included in the second set of video datamay be extracted from the memory at the third definition, which may bedifferent from the first definition, such that the second set of videodata may be provided to the user terminal at a third definition that isless than the first definition. The third definition may be higher thanthe second definition at which the first set of video data was provided.In some instances, the third definition may be at a minimum of 10%, 20%,30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 150%, 200%, 300%, 400%, 500%,600%, 700%, 800%, 900%, or 1000% higher resolution than the seconddefinition.

The second set of video data may be transmitted to the user terminal atwhich the video editing occurred. Alternatively, the second set of videodata may be transmitted to another device, in addition to the userterminal or instead of the user terminal. For instance, the second setof video data may be transmitted to a device of a different user (e.g.,computer, laptop, mobile device (e.g., smartphone, tablet) or of thesame user. In some instances, the second set of video data may beuploaded to the cloud or to a file sharing site. The second set of videodata may be uploaded to a video sharing site.

FIG. 4 is a flow chart 240 illustrating a method of processing a videoin an image processing system in accordance with an embodiment of thedisclosure. In step 242, video at a first definition is captured at animage capture device. The first definition may be a high definition,such as resolution values described elsewhere herein. Additionally, thevideo may be stored at a memory of the image capture device.Additionally, a copy of the video may be copied and sent to a userterminal as a first set of video data. At step 244, the first set ofvideo data is provided to the user terminal. The user terminal may beremote from the image capture device. In one example, the user terminalmay be a mobile device and the image capture device may be on a UAV. Theuser terminal and the image capture device may be physically separatedfrom one another. In some instances, no physical connection is providedbetween the user terminal and the image capture device and/or UAV. Theuser terminal and image capture device may or may not be within line ofsight of one another. The image capture device may be movable or may besupported by a movable object that may cause the distance and/orposition of the image capture device to change relative to the userterminal. In some alternative embodiments, the user terminal may becoupled to the image capture device. For example, if the image capturedevice is a stationary camera, the stationary camera may be connected toa user terminal for a user to interact with the first set of video data.

At step 246, the first set of video data is received at the userterminal. Once the first set of video data is received at the userterminal, a user may interact with the video data and generate a videoedit request. In particular, the video edit request may includeinformation that a user may want to include in an edited video formedfrom video having the same or similar content as the first set of videodata. Accordingly, at step 248 a video edit request is formed at theuser terminal. The video edit request may include sufficient data thatpermits the image capture device to provide a second set of video datain accordance with edits made by the user at the user terminal using thefirst set of video data.

Once the video edit request is formed, the video edit request may beprovided to the image capture device at step 250. The video edit requestmay be received at the image capture device at step 252. Once the videoedit request has been received at the image capture device, the imagecapture device may generate a second set of video data based on thevideo edit request. In particular, the image capture device may identifyvideo clips associated with the video edit request by accessing thememory. The memory may story video data that corresponds with the firstset of video data. In some embodiments, the video data stored at thememory may be at a high definition. As such, the video clips retrievedfrom the memory may be at a high definition.

After retrieving the video clips that correspond to the video editrequest, the image capture device may synthesize the video clips in anarrangement provided by the video edit request. In this way, the imagecapture device may generate the second set of video data (step 254) thatis responsive to the video edit request received from the user terminal.At step 256, the image capture device may provide the second set ofvideo data to the user terminal. At step 258, the user terminal mayreceive the second set of video data.

FIG. 5 illustrates exemplary flow 300 of image-related data among imagecapturing components in an image processing system, in accordance withembodiments. In some embodiments, image-related data includes raw orprocessed image data as well as data extracted or derived from the imagedata such as edited image data. As illustrated, raw image data can becaptured by an image sensor 304. The image sensor may be incommunication with one or more data storage devices 302. The imagesensor may also be in communication with an image transmission module306 and/or a pre-processing unit 308. The image transmission module maytransmit data to a user terminal 312. The pre-processing unit maycommunicate with a processing unit 310, which may transmit data to auser terminal. In some embodiments, the communication between the imagesensor 304 and the image transmission module 306 and the communicationbetween the image transmission module 306 and the user terminal 312 maybe bi-directional.

The image sensor 304 may be configured to convert optical signals intoelectronic signals. The image sensor may include semiconductorcharge-coupled devices (CCD), active pixel sensors using complementarymetal-oxide-semiconductor (CMOS) or N-type metal-oxide-semiconductor(NMOS, Live MOS) technologies, or any other types of sensors. The imagesensor may be coupled to a movable object such as a UAV. For example,the image sensor may be part of an imaging device (e.g., camera) that iscarried by a UAV with or without a carrier. The image sensor and/orimaging device may be configured to capture pictures, videos, or anyother image data with any suitable parameters such as width, height,aspect ratio, megapixel count, resolution or quality, and the like. Forexample, the imaging device may be configured to capture high-definitionor ultra-high-definition videos (e.g., 720p, 1080i, 1080p, 1440p, 2000p,2160p, 2540p, 4000p, 4320p, and so on).

An image sensor 304 may be coupled to a UAV and may be utilized tocapture image data while the UAV is in flight or otherwise engaged. Theimage sensor may be configured to capture image data while the UAV is inoperation and/or when the UAV is powered on or off. In an example, theimage data that is captured by an image sensor may be provided directlyto a user terminal in real-time. In another example, the image data maybe stored and/or processed prior to providing the image data to the userterminal. In some examples, the image data provided to the user terminalmay be at an equal or lower definition than the image data that iscaptured by the image sensor 304. In examples where the image data isprovided to the user terminal at a lower definition, the demand forbandwidth may be lessened.

When the image data is not provided directly to the user terminal, theimage data captured by the image sensor 304 can be stored in a datastorage device 302. The image data may be accessed from the data storagedevice at a later time and used to be provided to the user terminal. Insome instances, image data from the image sensor may be simultaneouslystored in the data storage device, pre-processed or processed, and/ortransmitted to a user terminal.

The data storage device 302 may be based on semiconductor, magnetic,optical, or any suitable technologies and may include flash memory, USBdrives, memory cards, solid-state drives (SSDs), hard disk drives(HDDs), floppy disks, optical disks, magnetic tapes, and the like. Forexample, the data storage device 302 can include removable storagedevices that may be detachably coupled to an imaging device such asmemory cards of any suitable formats such as PC Card, CompactFlash,SmartMedia, Memory Stick, Memory Stick Duo, Memory Stick PRO Duo,Miniature Card, Multimedia Card (MMC), Reduced Size Multimedia Card(RS-MMC), MMCmicro Card (MMCmicro), PS2 card, Secure Digital (SD) card,SxS, Universal Flash Storage (UFS), mini SD, microSD, xD-Picture Card,Intelligent Stick (iStick), Serial Flash Module (SFM), NT Card, XQDcard, and the like. The data storage device 302 can also includeexternal hard disk drives, optical drives, tape drives, floppy drives,and other suitable storage devices that may be operatively connected tothe imaging device.

The image data captured by image sensor 304 may be stored at a datastorage device 302 and, additionally, a copy of the image data capturedby image sensor 304 may be transmitted to a user terminal 312 by animage transmission module 306. The image transmission module may permitwireless communications with the use terminal. The image data that istransmitted to the user terminal 312 may be shown to correspond with theimage data that is stored at the data storage device 302, such asthrough the use of a source video identification. In some embodiments,the image data may be compressed or otherwise processed before beingtransmitted by the image transmission module 306. In other cases, theimage data may not be compressed or processed before being transmitted.In particular, image data may be compressed so as to decrease the burdenof transmitting image data across a network. This, in turn, may decreasethe time it takes to download image data on a user's device. In someembodiments, the image data may be compressed so as to minimize thedemand for wireless transmission of the image data. The image data maybe compressed to be less than about 95%, 90%, 80%, 70%, 60%, 50%, 40%,30%, 20%, 10%, or 5% of the original data. The transmitted image datamay be displayed on the user terminal 312 so that a user operating theuser terminal 312 can view the image data and/or interact with the userterminal 312 based on the image data.

Optionally, the image data captured by the image sensor 304 can bepre-processed by a pre-processing unit 308. The pre-processing unit 308can include any hardware, software, or a combination thereof. Examplesof a pre-processing unit 308 can include a field programmable gate array(FPGA). The pre-processing unit 308 can be operatively coupled to theimage sensor 304 to pre-processing of the raw image data before theimage data is processed to extract specific piece of information.Examples of tasks performed by the pre-processing unit 308 can includere-sampling to assure the correctness of the image coordinate system,noise reduction, contrast enhancement, scale space representation, andthe like.

The image data, as processed by the pre-processing unit 308, canoptionally be further processed by a processing unit 310 that isoperatively coupled to the pre-processing unit 308 (e.g., via a generalpurpose memory controller (GPMC) connection). Alternatively, no furtherprocessing is required. The processing unit 310 can include one or moreARM processors. The processing unit 310 can be configured to perform anysuitable embodiments of the methods described herein. Examples of tasksperformed by the processing unit 310 may include feature extraction atany suitable level of complexity, image segmentation, data verification,image recognition, image registration, image matching, and the like. Insome embodiments, the processing unit 310 produces auxiliary informationrelated to image data that has been captured by image sensor 304. Theauxiliary information may include, for example, the location, size, orother characteristics of a target within one or more images.

The auxiliary information as determined by the processing unit 310 canbe provided to the user terminal 312 via a communication module. In someembodiments, the communication module may be separate from the imagetransmission module 306 described above. In other embodiments, thecommunication module may include or be included in the imagetransmission module 306. Any suitable means of communication can be usedfor the communication module and/or the image transmission module 306,such as wired communication or wireless communication. For example, thecommunication module and/or the image transmission module 306 canutilize one or more of local area networks (LAN), wide area networks(WAN), infrared, radio, WiFi, point-to-point (P2P) networks,telecommunication networks, cloud communication, and the like.Optionally, relay stations, such as towers, satellites, or mobilestations, can be used. Wireless communications can be proximitydependent or proximity independent. In some embodiments, line-of-sightmay or may not be required for communications. In addition to trackinginformation, the communication module can also be configured to transmitand/or receive sensing data from other sensors onboard the movableobject, positional and/or motion information determined by processingthe sensing data, predetermined control data, user commands from remotecontrol terminals, and the like.

In some embodiments, the image data, as provided by the imagetransmission module 306, may be augmented by or otherwise combined withthe auxiliary information, as produced by the processing unit 310. Theuser may also interact with the user terminal based on the augmentedimage data. For example, the user may edit and/or create videos based onthe image data or the augmented image data.

In some embodiments, the image data can be provided to the user terminal312 in response to a demand by the user terminal 312. For example, theuser terminal 312 may demand such image data only when a user elects toa certain viewing mode (e.g., an active viewing mode that requests imagedata in real-time, or a delayed viewing mode that requests image thathas been buffered so that a user may fast-forward through recent-timeimage data. Alternatively or additionally, the image data may beprovided to the user terminal 312 without any demand by the userterminal 312. For example, the image data may be pushed to the userterminal 312 on a periodic basis (e.g., every 0.1 second, 0.2 second,0.5 second, 1 second, or 2 second). In some embodiments, data may bepushed down through the image transmission module or other communicationmodule. Alternatively, data may be pulled down from the user terminal,which may send one or more requests for data to the image transmissionmodule or other communication module.

FIG. 6 illustrates exemplary flow 400 of image-related data furtheramong image editing components in an image processing system, inaccordance with embodiments. In particular, FIG. 6 illustrates a flow ofimage-related data that produces an editing file at the user terminal.An example of an editing file is a video edit request, which isdiscussed in greater detail in FIG. 7.

As discussed in FIG. 5, video captured at a first definition by an imagecapture device may be stored at a memory that is accessible to the imagecapture device. Additionally, a copy of at least a portion of thecaptured video is provided as a first set of video data to a userterminal. In particular, the first set of video data is provided at asecond definition to a user terminal, such as user terminal 404 asillustrated in FIG. 6. Once the first set of video data is received atuser terminal 404, user 402 may interact with user terminal 404 to editthe first set of video data. The interactions of user 402 with userterminal 404 may be used to produce an editing file that includesediting information 406. Editing information 406 can includecharacteristics about a desired video clip, such as estimated startingtime, estimated stopping time, area of an image to behighlighted/enlarged/minimized, and the like. Editing information mayalso include data about audio files or may include audio filesthemselves to be combined with video data.

In some embodiments, editing information 406 can be generated when auser 402 edits video image data via a user interface provided by theuser terminal 404. For example, the user terminal 404 may allow a user402 to use a double click to select a starting point and an ending pointto identify a desired video clip. Based on the user selections andedits, editing information 406 may be generated. When a video clip isselected by the user, the information (attribution) of the video clipmay be recorded. For example, when a user selects a particular videosegment by engaging a double click, that selection may be identified asan editing operation that initiates storage of the video identificationof the video segment selection along with other characteristics of theediting. This information may be included in editing information 406,and may include at least one of a starting time in the video stream, aending time in the video stream, a time of being selected, a descriptionof the selected video clip, a GPS information, a route of shooting, aformat, a duration, a definition, etc. In some examples, the videoattribution information associated with selected clips and editingoperations may be stored in pre-configured files, such as a video editrequest. Additionally or alternatively, the attribution informationassociated with a selected clip may be embedded within the video ofinterest.

A user 402 may select one or more video clips from video data that isprovided to user terminal 404 by various ways, including a single click,double clicks, or pressing a video stream. The way of selecting a videoclip may be chosen by the user from an interface of an application thatis run on the user terminal 404. In some embodiments, video dataprovided to the user may include a live stream from an image capturedevice. Video data may be provided to a user while an image capturingdevice is in operation and/or while a UAV is in flight. The video dataprovided to the user may also be video data from a data storage unit(e.g., memory). This may include recently shot video, or video that wascaptured further in the past. The user may be able to access the videostored in the data storage unit regardless of when the video wascaptured. For instance, the user may be able to access video that wasjust captured within the past several seconds, video captured within thepast several hours, video captured within the past several days, videocaptured within the past several weeks, or video captured within thepast several months, or any other period of time.

If the user 402 wishes to select a video clip by single clicking thevideo stream, a video clip may be selected by extending a time periodcentering the time of the single click. For example, when the usersingle clicks at 15:35 of the video stream, a video clip may be selectedfrom 15:30 to 15:40, or from 15:33 to 15:43, or from 15:28 to 15:43,depending on a preset configuration. In additional examples, a user mayselect a video clip by extending a time period that is initiated at thetime of the single click. For example, when the user single clicks at16:40 of the video stream, a video clip may be selected from16:40-16:50. In another example, a user may select a video clip bybackwardly extending a time period that is initiated at the time of thesingle click. For example, when a user single clicks at 16:30 of thevideo stream, a video clip may be selected from 16:30-16:40.

Additionally, when the user wishes to select a video clip by doubleclicking the video stream, a video clip may be selected having aduration equal to the time period between the two clicks. Further, ifthe user wishes to select a video clip by pressing on the video steam, avideo clip may be selected having a duration equal to the time period ofthe pressing. In another example, a certain advance time period (e.g., 2s) may be added to a method of video clip selection. For example, if theuser presses on the video stream at 15:35 and releases at 15:40, a videoclip may be selected from 15:33 to 15:40. This additional two secondtime delay allows the user to record video that the user may have beentoo slow to initiate at the precise moment that the user wanted to beginselection.

A user may be able to access any number of video clips stored in a datastorage unit. The user may be able to see files or thumbnails of thevarious video clips that have been stored. The user may select one ormore of the clips to edit. In some instances, information about theavailable video clips may be transmitted to a user terminal for the userto view and select. When a user selects one or more video clips, a setof video data corresponding to the video clip may be transmitted to theuser terminal. The set of video data may be at a lower definition thanthe video data originally captured.

This editing information 406 that is produced by the user 402interacting with the user terminal 404 may be used in editing the videoclip. For example, when combining the selected video clips, the user maywant to combine the video clips in a sequence of being selected (whichmay be different from a sequence in the original video stream), or theuser may want to combine selected video clips having the same, orsimilar, GPS information. Additionally, as the user decides betweenvideo clips, not all of the selected video clips may be chosen to becombined into a final video. For example, the user may select some videoclips from those selected clips according to certain criteria.Alternatively, a user may choose to apply a filter to certain videoclips, or may choose to select only video clips that meet a thresholddegree of brightness.

Accordingly, the user may choose whether or not to store selected videoclips, or may choose to only store a subset of selected video clips.Selected video clips may be stored in a directory such as “highlightclips”. Once a directory has been generated, the user may want tocombine clips that are subsequently selected from this directory. Theselected video clips may be stored in various formats such as MP4, AVI,MOV. Alternatively, the selected video clips may not be stored in orderto save storage space. In order to lessen space constraints, the usermay choose to store the information (attribution) of the selected videoclips, rather than the full selected clips themselves, thereby forming avideo edit request.

In some embodiments, the information (attribution) of the video clips bywhich the final video file is composed may be stored in the final videofile. As such, the instruction to store this attribution information inthe final video file may be included in the video edit request. In thisway, when playing the final video file, the user may know theinformation (attribution) of the video clips, such as the GPSinformation of each clip.

Additionally, the user may want to further edit the selected videoclips. For example, the user may want to further select a duration ofvideo from a selected clip. This can be performed by dragging a timewindow, or by editing the information (attribution) of the video clip(e.g., edit a starting time or an ending time of the video clip).

In another example, the user may want to delete a selected video clip.This can be performed by removing the selected video clip from thedirectory, or by removing the information (attribution) of the videoclip. The user may also want to add new video clips. This can beperformed by selecting a new video clip from the stream, or by addingnew information (attribution) of a video clip.

Additionally, users may add attribution information when editing videoclips. The attribution of selected video clips may include a rating fromthe user. For example, the user may add his rating (three stars, fourstars . . . ) when selecting the video clip. Alternatively, the ratingmay be a default value when the video clip is selected, and the user maychoose to revise it later. Further, the attribution of selected videoclips may be stored together with the video clips in a directory orlibrary, such that the user may further select clips in the directory orlibrary at a later time.

After the user has stored selected video clips, such as in a directoryor a library, the user may set an edit rule for further selecting videoclips to be combined from the directory or library. The edit rule may bea preset rule or a custom rule. For example, a preset rule may be:extracting video clips with a capturing date ###. For another example, apreset rule may be: extracting video clips with a capturing location“riverside”. For another example, a preset rule may be: extracting videoclips with a rating higher than four stars. The preset rule may acombination of above examples. The preset rule may be edited by theuser.

Alternatively, the edit rule may be a custom rule created by the user.Alternatively, the user may set a custom rule by selecting from a menuof editing rules. For example, the user may set a custom rule as:extracting two clips with a rating higher than four stars, one clip witha capture location “forest”, and one clip with a capture date “today”.Based on the custom rule, the video clips meeting the edit rule may befurther selected from the clip directory or library, for furtherprocessing such as combining.

Once selected video clips are identified, the user may request that afinal video be produced by simple combining. This combining may be in asequence of a time stamp of each video clips in original stream, in asequence of being selected by the user, in a sequence of rating, or in asequence of location, etc. Alternatively, the user may use a templatefor video combining and/or synthesizing. For example, the user mayselect a template from a template library, and then insert the selectedvideo clips into the template. The template may include preset audio,subtitle and/or transition effect (a kind clip between two video clips).

The user may edit a template or may create a custom template.Alternatively, the user may select audio, subtitle and/or transitioneffect from an audio library, a subtitle library and/or a transitioneffect library when synthesizing the selected video clips. Thesecustomizations may be included in the video edit request.

Additionally, the user may preview the combined/synthesized video file,and further revise it. For example, the user may re-arrange the sequenceof the clips. In some embodiments, the user can select a “randomsequence” button or “shaking” the terminal in order to have the clipsre-arranged. For another example, the user can add, delete or edit anyclip to be included in the final video file. Once a final video has beencomposed, a video edit request that contains editing information 406associated with the final video can be provided to a processing system408. For example, the editing file that includes editing information 406may be provided to processing system 408 via a communication module (notshown). The processing system 408 may be onboard a movable object (e.g.,UAV). The processing system may encompass an image capture device, amovable object, or both. In some embodiments, an image processing systemmay include a user terminal as well.

FIG. 7 illustrates an exemplary video edit request 500, which may be anediting file that may be generated at a user terminal of an imageprocessing system, in accordance with embodiments. The video editrequest 500 may be transmitted from a user terminal to an imageprocessing device, such as image capture device 304 discussed above. Thevideo edit request may be a file. The video edit request may begenerated at a user terminal in response to a user's edits of videodata.

The video edit request 500 may include editing information related to afirst set of video data that is stored in, or accessible to, the imagecapture device. Accordingly, the editing information may be used by theimage capture device to identify video that corresponds to the videoedit request 500. In particular, the editing information may be used bythe image capture device to identify video segments within the first setof video data. After identifying the video that corresponds with thevideo edit request, the video capture device may generate edited videodata that complies with the video edit request.

As seen in FIG. 7, video edit request 500 may include segmentspecifications 502. Segment specifications provide information 504related to a desired video segment, such as source video identification;a relative starting time of the video clip; a relative ending time ofthe video clip; filter effect; output size and frame rate; bit rate ofvideo stream; and comments or others. For instance, the comments orothers may include other interested or supplementary information on thedesired video segment, e.g., a model of UAV, flight informationdescribing flight parameters when the video segment is captured. Oncethe information 504 associated with segment specification 502 has beenprovided to the image capture device, a processor of the image capturedevice may use the information 504 to identify video data that is storedat the image capture device and may edit the video data so as to beconsistent with the specifications listed in segment specification 502.For example, a video edit request may include a plurality of segmentsthat are to be arranged in a specific order, for example, if a userrequests a compilation of multiple video clips.

The information 504 may include additional details about the clip, suchas any visual filters or effects that may be applied to the clip. Forexample, if a text overlay is applied to the clip, or a visual imagesuperimposed over a portion of the clip, such details may be included inthe information. Additionally, if any audio is added or modified, thevideo edit request may include information about the audio. For example,audio accompanying the image portion of a video file may be tracked tocorrespond to the edited video file. Similarly, the audio may bemodified or added. For example, one or more segments of the accompanyingaudio may be muted out, or a voice or musical overlay may be added. Insome instances, the audio data may be provided from the image capturedevice and/or UAV. Source audio files may also be provided from thirdparty sources (e.g., songs) or may be generated at the user terminal(e.g., a user may record the user's voice, which may be used to narratea portion of the video). Such information may or may not be includedwith the video edit request.

FIG. 8 illustrates exemplary flow 600 of edited image-related datafurther among image generating components in an image processing system,in accordance with embodiments. An editing file, such as video editingrequest 500 as seen in FIG. 7, may be received at an image capturedevice 604. Image capture device 604 may be on-board a UAV or part of aUAV, such as a UAV 100 as seen in FIG. 1. Alternatively, the imagecapture device may be on any other type of movable object, such as aland-bound vehicle, aerial vehicle, water-based vehicle, or space-basedvehicle. The image capture device may be carried on a handheldstabilizer, vehicle-supported stabilizer, or any other type ofstabilizer. The image capture device may be worn by a human or animal.Once the image capture device 604 receives the editing file, imagecapture device 604 may access a memory 602 that stores image dataassociated with the editing file. Memory 602 may be stored locally onimage capture device 604, or memory 602 may be accessed remotely. Inparticular, memory 602 may be used to store video that was captured byan image capture device, such as image capture device 604, and providedto a user terminal. Further, the editing file may include editinginformation that relates to the video that was captured by the imagecapture device and sent to the user terminal. In particular, when thevideo is edited at the user terminal, the editing information associatedwith the video is described in the editing file and associated with thevideo that is stored at memory 602. As such, the image capture devicedoes not need the set of video content that was edited at the userterminal in order to generate a requested edited video. Rather, theimage capture device merely accesses a copy of the video that isassociated with the editing request and edits that video accordingly.

One benefit of this approach is that image capture device may edit videothat is captured and stored at a high definition. In contrast, the videothat is edited at the user terminal may be of a lower definition, for anumber of reasons. For example, the user device may not be able toreadily upload high definition video. Additionally, a user may not wantto use the bandwidth associated with high definition video for thepurpose of merely forming a video request. The user may also want toavoid excess data fees associated with downloading high definitionvideo. Further, the time that it takes to download high definition videomay be significantly longer than the time that it takes to downloadlower definition video. For all of these reasons, the user may prefer toutilize lower definition video to form a video edit request. As such,the image capture device may receive an editing file, such as a videoedit request, which provides instructions on how image capture device isto edit video that has already been stored at memory 602.

In order to edit video that is stored at memory 602, the image capturedevice may retrieve video clips that are associated with the editingfile. In particular, the image capture device may retrieve video clipsbased on attribution information that is provided in the editing file.For example, the video edit request may include segment specificationsthat indicate the video identification of video that contains one ormore selected video clips. The segment specification may also includeattribution information that describes a starting point and an endingpoint of a desired video clip. In this way, the image capture device mayretrieve video clips that are associated with the video edit requestfrom memory 602.

In another example, the image capture device may retrieve video clipsbased on attribution information that is embedded in video. Theattribution information may be embedded by the processor of the imagecapture device at the time of capturing the data, or the attributiondata may be embedded into the video based on instructions from a videoedit file. Further, the image capture device may find and extract videoclips that are indicated by the attribute information that is embeddedin the, such as upon detection of a video synthesis command. Videosynthesis may include the merging of video clips. After the video clipshave been identified and extracted, the image capture device maysynthesize the video clips into a video file. In another example, theimage capture device may adjust the resolution of video clips whenextracting the video clips from the video based on the attributioninformation. Additionally, the extracted video clips may be synthesizedbased on the starting point of the video clips.

Once selected video clips have been retrieved, the video clipsassociated with the editing file may be provided from memory 602 toprocessor 606. Processor 606 may be configured to edit the video clipsbased on the editing file. In particular, processor 606 may be used tosynthesize selected clips into a final video. The video clips may besynthesized chronologically, or may be organized based on additionalinstructions provided in the editing file. In one example, the imagedata may be edited to provide a plurality of video clips that arearranged in an order that corresponds to instructions within a videoedit request provided to image capture device 604 from a user terminal.For example, the video edit request may specify the way that each videoclip is arranged within a pre-configured template. For example, a musictemplate may be provided that organizes video segments based on therhythm of the music. Using a music template, the music may be organizedinto several segments such that a user may fill in time slots of themusical segments with different video clips. Based on theseinstructions, processor 606 may place each of the selected video clipsinto the place assigned based on the instructions in the editing file.

In another example, the selected video clips may be further edited byprocessor 606 after the video clips have been retrieved from memory 602.For example, instructions in video edit request may specify that one ormore of the selected video clips be edited to include a filter.Alternatively, the instructions in video edit request may provide thatcertain video clips be provided at a higher playback speed than othervideo clips. These characteristic differences in the presentation ofcertain video clips may be encoded by editing instructions and executedusing processor 606.

In this way, the plurality of video clips may be organized as a secondset of image data. Once the image data has been edited into the secondset of image data, the image data may be provided from processor 606 toa user terminal 610 via image transmission module 608. The second set ofimage data may include multiple video clips, or may be formed as asingle video file that may encompass the multiple video clips.

FIG. 9 is a flow chart illustrating a method of processing a video at auser terminal in accordance with an embodiment of the disclosure.

In step 3-102, a determination may be made at the user terminal onwhether a video captured by an image capture device is to be processed.

The image capture device may be a device having video capturecapability. In some embodiments, the image capture device may be on orpart of a movable object such as an aerial vehicle, a ground vehicle ora mobile phone held by a user. Alternatively, the image capture devicemay be a stationery object such as a camera installed on a building. Theuser terminal may be a device which can communicate with the remoteimage capture device via a wireless link or a cable. In someembodiments, the user terminal may be a smartphone. The user terminalmay have one or more applications installed thereon. Alternatively, noseparate applications are provided on the use terminal. In someinstances, the user terminal may have a video editing applicationinstalled thereon. The user terminal may be a remote controller. In someembodiments, physical buttons may be provided on the user terminal, bywhich the user may input signals indicative of operation commands.Alternatively, soft buttons or virtual buttons may be displayed onsoftware application running on the user terminal. The user may inputsignal indicative of operation commands by pressing or touching the softbuttons. In some embodiments, the user terminal may include a display.The user terminal may include a touchscreen.

In some embodiments, the step 3-102 may comprise receiving an input froma user at the user terminal indicative of a video processing request.The user may input his/her decision on whether the video captured by animage capture device such as a UAV is to be processed by pressing one ormore buttons provided on the user terminal. In some instances, the usermay operate a wireless remote controller and input his/her decision bypressing one or more physical buttons. Alternatively, the user mayoperate a smart phone having no physical button, and input his/herdecision by pressing a soft button displayed by a software applicationrunning on the smart phone. For instance, a user may provide an input toa user terminal to perform an edit function on a video. The input toperform the edit function may be an indication that the video capturedby the image capture device is to be processed. A determination may bemade that a video captured by an image capture device is to be processedwhen the user provides an input indicating an editing function to beperformed on the video. In alternate embodiments, one or moreprocessors, without requiring human intervention, may provide anindication of an editing function to be performed on a video.

In some embodiments, the processing to be performed on the capturedvideo may include merging one or more video clips to the captured video.The one or more video clips may include a prologue and/or an epilogue.As used here, a prologue may be a video clip to be added at thebeginning of a video, and an epilogue may be a video clip to be added atthe end of a video. Optionally, the processing to be performed on thecaptured video may include adding one or more transition effects to thecaptured video. As used here, a transition effect may be a video clip tobe added in a middle of a video. Optionally, the processing to beperformed on the captured video may include applying one or more filtereffect to the captured video. Processing of the video may includeshortening a video clip, deleting a portion of a video clip, modifyingor adding audio data, or performing any of the other editing functionsdescribed elsewhere herein. Merging of video clips are described hereinas an example, but any other processing functions may occur.

If the determination in step 3-102 is YES, then in step 3-104, one ormore video clips to be merged to the video captured by the image capturedevice may be selected at the user terminal, thereby forming a videoedit request. If the determination in step 3-102 is NO, then theprocessing remains in step 3-102.

In some embodiments, the one or more video clips may include at leastone of a prologue, an epilogue and a transition effect. The one or morevideo clips may be selected from a memory device installed on the imagecapture device. In some instances, the memory device may be anon-transitory computer readable medium that may include one or morememory units (e.g., removable media or external storage such as a SecureDigital (SD) card, or a random access memory (RAM), or a read onlymemory (ROM) or a flash memory) onboard a UAV and/or image capturedevice. Alternatively, the one or more video clips may be selected froma memory device installed on the user terminal. Optionally, the one ormore video clips may be selected from an online video library, such as avideo storage in the cloud. In either cases, the video clips stored inthe memory device installed on the image capture device, or the memorydevice installed on the user terminal, or the online video library maybe prepared before the step 3-102.

In some embodiments, the one or more video clips may be selected byplaying one or more video clips, which may be stored in a memory deviceinstalled on the image capture device or on the user terminal or in anonline library. For example, the user may respectively play one or morevideo clips that are stored in a SD card onboard a UAV and select avideo clip having riverside scene as the prologue. Alternatively, theone or more video clips may be selected by viewing descriptions of oneor more video clips. In some instances, the description may beinformation describing content, a category, a location, a duration,and/or a rating of a video clip. For example, the user may select avideo clip that is a scene of riverside as the prologue, by checking acorresponding description under a “riverside” category.

In some embodiments, the step 3-104 may comprise receiving an input fromthe user at the user terminal indicative of a selection of one or morevideo clips. In some instances, the user input may be made by pressing aphysical button of the user terminal such as a wireless remotecontroller. Optionally, the user input may be made by pressing a softbutton that is displayed on the user terminal such as a smart phone. Forinstances, the user may operate a joystick or press one or more buttonsto select a video clip having a “riverside scene” from an online videolibrary and determine to use this video clip as prologue.

In some embodiments, the step 3-104 may comprise receiving a template.In some instances, the template may be a system default template.Alternatively, the template may be a template selected by the user froma plurality of preset templates. The plurality of preset templates maybe stored in a memory device installed on the image capture device suchas a UAV. Optionally, the plurality of preset templates may be stored ina memory device installed on the user terminal such as a smart phone.Optionally, the plurality of preset templates may be stored in an onlinetemplate library, such as a template storage in the cloud.

In some instances, the template may comprise at least one of a prologue,an epilogue and a transition effect. Optionally, the template mayfurther comprise at least one of an audio and a subtitle. For instances,a user selected template may comprises one prologue, two transitioneffects and an audio. In this case, the video captured by the UAV may beprocessed by accommodating the captured video in the template, such thatthe processed video contains prologue, video, transition effect andbackground music.

In some instances, the user may edit the system default template or theonline templates, such as modifying at least one of a prologue, anepilogue, a transition effect, an audio and a subtitle of the template.Optionally, the user may create new template by determining at least oneof a prologue, an epilogue, a transition effect, an audio and asubtitle. The user created template may be stored as the system defaulttemplate or as the online template.

In some embodiment, the step 3-104 may comprise determining the one ormore video clips as at least one of a prologue, an epilogue and atransition effect. A prologue may indicate the video clip is to be addedto a front of the video. An epilogue may indicate the video clip is tobe added to an end of the video. A transition effect may indicate thatthe video clip is to be inserted into a video. Upon completing the step3-102, at least one of a prologue, an epilogue and a transition effectmay be determined.

Optionally, the step 3-104 may comprise selecting a filter effect to beapplied to the one or more video clips. For instances, the filter effectmay include at least one or more effects selected from basis tuningfilters, color filters, stylish filters, geometric filters, and thelike. Examples of basic tuning filters include brightness, exposure,contrast modification, white balance, highlighting, shadow, contrast,hue, sharpness, saturation, noise reduction, and gamma. Examples ofcolor filters include color inversion, thermal representation usingcolor, grayscale, color modification, old film coloring, sepia, andcolor LUT. Examples of stylish filters include cartoons, sketching,blurring, stretching, pixelation, posterization, single color andgrayscale, golden, suiboku, suicai, and oil painting. Examples ofgeometric filters include bulge, swirl, explosion, sphere, and lensvignetting. The filter effects may be selected by the user from a listof filter effects, by pressing a physical button on the user terminal,or by pressing a soft or virtual button as displayed on the userterminal.

In step 3-104, a video edit request may be generated at the userterminal after selecting the one or more video clips to be merged to thevideo captured by the image capture device. The edit request may be asmall size file containing at least information indicative of theprologue, the one or more transition effects and the epilogue to bemerged, as selected by the user in step 3-104.

In step 3-106, the video edit request as formed in step 3-104 may betransmitted to the image capture device to cause the image capturedevice to process the video by merging the one or more video clips withthe video.

The video edit request may be transmitted to the image capture devicefrom the user terminal. Since a size of the video edit request is small,a transmission of the video edit request may occupy less bandwidth. Forinstances, a transmission of the video edit request may utilize one ormore of cable, local area networks (LAN), wide area networks (WAN),infrared, radio, WiFi, point-to-point (P2P) networks, telecommunicationnetworks (a 2G link, a 3G link, or a LTE link), cloud communication, andthe like. In some instances, the transmission of the video edit requestmay be encoded and/or encrypted for safety reasons.

The image capture device such as a UAV may receive and analyze the videoedit request by extracting the recorded information, and performcorresponding video processing on the captured video. In some instances,the processing performed by the image capture device may include atleast merging at least a prologue, an epilogue and a transition effect,as described in the video edit request, to the captured video. Forexample, the video edit request may record a prologue and an epilogue tobe merged to the captured video by describing an ID and a directory ofthe prologue and epilogue. The image capture device may merge theprologue and the epilogue by adding the prologue and epilogue to abeginning and an end of the captured video, respectively. For anotherexample, the video edit request may record a transition effect to bemerged to the captured video by describing an ID of the transitioneffect, a directory of the transition effect, and a relative start timeof the transition effect. The image capture device may merge thetransition effect by inserting the transition effect to a middle of thecaptured video at the relative start time.

In some embodiments, the method of processing a video at a user terminalin accordance with an embodiment of the disclosure as shown in FIG. 9may further receiving the merged video from the image capture device.The merged video may be a high definition video file to which the one ormore video clips are merged as a prologue, a transition effect and anepilogue. In some instances, the definition of the merged video may beidentical to a definition in capturing the video by the image capturedevice, such as 4K. Alternatively, the definition of the merged videomay be lower than a definition in capturing the video by the imagecapture device. Optionally, the definition of the merged video may behigher than a definition in capturing the video by the image capturedevice, by means of interpolation. In some instances, a format of themerged video may be MP4 (Moving Picture Experts Group Audio Layer IV),AVI (Audio Video Interleave), MOV (Quicktime Movie Format), or the like.

FIG. 10 is an example of content in the video edit request as formed ata user terminal in accordance with an embodiment of the disclosure. Thecontent in the video edit request as shown in FIG. 10 is forillustrative purposes only, and may vary according to various protocols.As described hereinabove, the edit request may be a file describing atleast the prologue, the one or more transition effects and the epilogueas selected by the user.

In the example shown in FIG. 10, the video edit request may at leastcomprise the description of a prologue, one or more transition effectand/or an epilogue to be merged to the captured video. In someembodiments, the description may include at least the informationsufficient for identifying the video clips for prologue, transitioneffect and/or epilogue and the information for defining an outputparameter of the video clips. For instances, the video edit request mayat least comprise a video edit request file title, a video ID, a videodirectory for identifying a storage location, a filter effect to beapplied, output size and frame rate of the video clips and bit rate ofthe video clips.

The content of the video edit request may vary according to differentprotocols. In some instances, the content of the video edit request mayalso vary according to a demand of the user. For instance, if more thanone transition effects are to be merged in the captured video, moresegments for transition effect may be recorded in the video editrequest.

FIG. 11 is a flow chart illustrating a method of preparing video clipsfor a prologue, a transition effect and/or an epilogue at a userterminal in accordance with an embodiment of the disclosure. The methodof preparing video clips as shown in FIG. 11 may be performed at theuser terminal to prepare video clips to be used as a prologue, atransition effect and/or an epilogue, and may be optionally performedbefore the method of processing a video at a user terminal as shown inFIG. 9, as discussed herein above.

In step 3-302, a determination may be made on whether a video clip forprologue, transition effect and/or epilogue is to be captured on site.In some embodiments, the user may want to capture a video clip oncurrent scene and store it such that this video clip may be later usedas a prologue, a transition effect and/or an epilogue. The determinationmay be made by the user by pressing a physical button on the userterminal such as a remote controller or a smart phone. Alternatively,the determination may be made by pressing a soft or virtual button asdisplayed on the user terminal.

If the determination in step 3-302 is YES, then in step 3-304, a videoclip on current scene may be captured by the image capture device suchas a camera, a UAV carrying a camera unit, or a mobile phone having acamera module. The current scene may be a scene of the environment. Insome instances, the user may control a direction, a zoom and/or aneffect of a camera unit of the image capture device by sendinginstructions through the user device. The captured video clip may bestored for future use. In some instances, the captured video clip may bestored in a memory device installed on the image capture device.Alternatively, the captured video clip may be transmitted and stored ina memory device installed on the user terminal through a wireless linkor a cable. Optionally, the captured video clip may be uploaded to anonline video library, such as a video storage in the cloud. In someinstances, when the captured video clip is transmitted to the userterminal, the captured video clip may be merged with a video on the userterminal. For instance, as the video clip is captured, the video clipmay be merged with an identified video. Alternatively, the video clipmay be accessed from a memory and merged at a future time.

In some instances, the captured video clip may be transmitted, uploadedand/or stored in a format of YUV, Y′UV, YCbCr, YPbPr, H264, H265, MP4(Moving Picture Experts Group Audio Layer IV), AVI (Audio VideoInterleave), MOV (Quicktime Movie Format), or the like.

If the determination in step 3-302 is NO, then in step 3-306, existingvideo clips for prologue, transition effect and/or epilogue may beprepared. In some embodiments, the user may browse a plurality ofexisting video clips that are locally stored, select an existing videoclip, edit the existing video clip as a video clip appropriate for aprologue, a transition effect or an epilogue. For instances, the usermay select a video clip that is locally stored in the user terminal, andedit the video clip to a clip have a duration of 5 seconds by dragging atime window. Optionally, the prepared one or more video clip may bestored in a local directory for future use as a prologue, a transitioneffect or an epilogue. Alternatively, the prepared one or more videoclip may be uploaded and stored in a memory device installed on theimage capture device, or in an online video library, such as a videostorage in the cloud.

If the step 3-304 or step 3-306 is finished, the method of preparingvideo clips for a prologue, a transition effect and an epilogue may endin step 3-308. Subsequently, a merging process may occur.

FIG. 12 is a flow chart illustrating a method of merging a prologue, atransition effect and/or an epilogue at an image capture device inaccordance with an embodiment of the disclosure. The method of merging aprologue, a transition effect and/or an epilogue may be performed at theimage capture device after the method of processing a video at a userterminal as discussed hereinabove with reference to FIG. 9.

In step 3-402, the image capture device may capture a video. The imagecapture device may include a camera unit. In some instances, the cameraunit may include an optical sensor and one or more lens for directinglight into the optical sensor. In some embodiments, the captured videomay be stored in a memory device such as a SD card onboard the imagecapture device. In some instances, the captured video may be stored in amemory device in a FIFO (First In, First Out) manner, such that only apredetermined length of video may be stored.

In step 3-404, a video edit request may be received from a userterminal. The video edit request may be formed from a selection of oneor more video clips to be merged to the captured video when adetermination is made at the user terminal that the captured video is tobe processed. The video edit request may be formed at the user terminal,and a processing of forming the video edit request is described in step3-104 of FIG. 9, as discussed hereinabove. The video edit request maycontain at least information indicative of the prologue, the one or moretransition effects and the epilogue that is to be merged to the capturedvideo. An example of content in the video edit request is described inFIG. 10, as discussed hereinabove. The video edit request may occur anytime after the video has been captured. This may include while the videois being live-streamed, within seconds after the video is captured,within hours after the video is captured, within days after the video iscaptured, within months after the video is captured, or any other timeafter the video is captured. A user may view the captured video and makea determination to edit the video, at any time. The user may make thedetermination to edit the video while a UAV supporting the image capturedevice used to capture the video is in flight. A user may be capable ofediting a video while a UAV is in flight (e.g., the same flight duringwhich the video was captured, or a later flight).

In step 3-406, the one or more video clips, which are identified fromthe received video edit request, may be merged to the video captured bythe image capture device. In some embodiments, the process of the step3-406 may include at least adding a prologue and/or an epilogue to abeginning or an end of the captured video and/or inserting a transitioneffect to a middle of the captured video, based on the video editrequest. The video clips merged with the video captured by the imagecapture device may also be captured by the same image capture device.Alternatively, the video clips may be captured using a different imagecapture device. In other instances, the video clips may be generatedusing a device (e.g., computer-generated).

In some embodiments, a sequence of performing the step 3-402 and step3-404 may be alternated. For example, the video edit request may bereceived at the image capture device before a video is captured.

In some embodiments, the method of merging a prologue, a transitioneffect and/or an epilogue at an image capture device as shown in FIG. 12may comprise storing the merged video in a memory device. In someinstances, the merged video may be stored in a memory device installedon the image capture device, such as a SD card, or stored onboard a UAV.Alternatively, the merged video may be stored a memory device installedon the user terminal. Optionally, the merged video may be stored anonline video library in the cloud. In some instance, the merged videomay be stored in more than one memory devices as discussed hereinabove.

In some embodiments, the method of merging a prologue, a transitioneffect and/or an epilogue at an image capture device as shown in FIG. 12may comprise transmitting the merged video to the user terminal. In someinstances, the merged video may be transmitted in a high definition,such as 4K, or any of the other resolutions described elsewhere herein.In some embodiments, the merged video may be transmitted in definitionidentical to a definition in capturing the video by the image capturedevice. Alternatively, the merged video may be transmitted at adefinition lower than a definition in capturing the video by the imagecapture device. Optionally, the merged video may be transmitted indefinition higher than a definition in capturing the video by the imagecapture device, by means of interpolation. In some embodiments, a usermay view, at a user terminal, the video captured by the image capturedevice and the one or more video clips to be merged with the video at adefinition that may be equal to or lower than the definition at whichthe video and/or video clips are originally captured or created. Afterthe user edits the videos, a video edit request may be sent to an imagecapture device or other external device or cloud storage. In response tothe video edit request, a merged video, in accordance with the useredits made, may be provided to the user terminal. The merged video maybe at a resolution less than or equal to the resolution at which thevideo and/or video clips are originally captured or created. The mergedvideo may be at a resolution greater than or equal to resolution atwhich the videos were edited on the user terminal. In some instances,the merged video may be transmitted in a format of MP4 (Moving PictureExperts Group Audio Layer IV), AVI (Audio Video Interleave), MOV(Quicktime Movie Format), or the like.

FIG. 13 is a flow chart illustrating a method of merging a prologue, atransition effect and/or an epilogue at an image capture device inaccordance with an embodiment of the disclosure. This method of FIG. 13may comprise a plurality of processing as performed in step 3-406 ofFIG. 12, as discussed hereinabove.

In step 3-502, a prologue may be merged. In some embodiments, a videoclip that is identified based on the video edit request may be added toa beginning of the video to be captured by the image capture device. Forinstance, a video clip that is a scene of “river side” may be identifiedbased on the video edit request, fetched from an online video library,and used as the prologue to the video to be captured by the imagecapture device such as a UAV.

In step 3-504, a video may be captured by the image capture device, suchas a UAV carrying a camera unit. In some embodiments, the captured videomay be stored in a memory device such as a SD card onboard the imagecapture device.

In step 3-506, a determination may be made on whether a transitioneffect is a video clip of the current scene. A transition effect may beidentified by analyzing the video edit request received from the userterminal. For instance, the video edit request may contain informationon a relative start time of a transition effect. In this case, at therelative start time, e.g., 15 seconds from the starting of the capturedvideo, a video clip for transition effect may be inserted as atransition effect.

In some embodiments, the transition effect may be a video clip to befetched from a memory device installed on the image capture device suchas a SD card onboard a UAV.

Alternatively, the transition effect may be a video clip to be fetchedfrom a memory device installed on the user terminal. Optionally, thetransition effect may be a video clip to be downloaded from an onlinevideo library in the cloud.

In other embodiments, the transition effect may be a video clip oncurrent scene. For instance, the user may want to apply certain filtereffect to the current scene and use this effected video clip astransition effect.

If the determination in step 3-506 is YES, that is, a video clip oncurrent scene may be used as transition effect, then in step 3-508, avideo clip of current scene may be captured and used as the transitioneffect. In some embodiments, the step 3-506 may include applying one ormore filter effects to the video clip of current scene. In someinstances, the video edit request received from the user terminal mayrecord the information for instructing the image capture device to usethe video clip of current scene as the transition effect, and theinformation on filter effect to be applied on the video clip of currentscene. Alternatively, the user may send an instant instruction from theuser terminal, instructing the image capture device to use the videoclip of current scene as the transition effect, as well as theinformation on filter effect to be applied on the video clip of currentscene.

If the determination in step 3-506 is NO, that is, an external videoclip may be used as transition effect, then in step 3-510, a video clipof current scene may be fetched from a memory device based on the videoedit request. In some instances, the video edit request received fromthe user terminal may record the information on a location of thedesired video clip for the transition effect, and/or the information onfilter effect to be applied on the video clip. In this case, the videocapturing of the image capture device may be suspended, and the externalvideo clip may be received and merged as the transition effect. Thevideo capturing of the image capture device may resume after acompletion of receiving and merging the external video clip as thetransition effect.

In step 3-512, an epilogue may be merged. In some embodiments, a videoclip that is identified based on the video edit request may be added toan end of the video being captured by the image capture device. Forinstance, a video clip that is a scene of “forest” may be identifiedbased on the video edit request, fetched from an SD card onboard theimage capture device, and used as the epilogue to the video beingcaptured by the image capture device such as a UAV. To this end, amerged video may be produced.

In some embodiments, a sequence of performing the steps in the method asdiscussed hereinabove in FIG. 13 may be changed. For example, the stepof merging a prologue, merging a transition effect, and/or merging anepilogue may be performed after a video is completely captured at theimage capture device. In this case, the video clip of current scene ascaptured in step 3-508 may be buffered or stored until a video iscompletely captured by the image capture device. In some embodiments, auser may choose to perform one, two, or all three of a prologue merging,transition effect, and/or epilogue merging. In some instances, a singlevideo clip, or multiple video clips may be merged with a video. Thus,one or more steps of FIG. 13 may not need to occur, or may occur indifferent orders. In some instances, the video clips to be merged withthe video may come from the same source (e.g., from the same memory ordata storage unit), or may be provided from different sources (e.g.,some video clips may come from an image capture device, some video clipsmay come from a cloud storage device, some video clips may come from aseparate device, some video clips may come from the user terminal). Insome embodiments, the user terminal may receive video clips and/or thevideo from a single source or from multiple sources. Similarly, theimage capture device and/or movable object may create the merged videousing video clips and/or video from a single source (e.g., memory of theimage capture device and/or movable object) or multiple sources.

In some embodiments, if the user wishes to merge the same prologue,transition effect and/or epilogue, the configuration on prologue,transition effect and/or epilogue may be saved, for example, as atemplate. The configuration on prologue, transition effect and epiloguemay be saved by pressing a pressing a physical button or a soft buttonon the user terminal such as a remote controller or a smart phone.Alternatively, the configuration may be saved by pressing a pressing aphysical button or a soft button on the image capture device such as aUAV carrying a camera unit.

FIG. 14 is a schematic illustration by way of block diagram of a systemfor processing a video, in accordance with embodiments of the presentdisclosure.

The system for processing a video in accordance with embodiments of thepresent disclosure as shown in FIG. 14 may comprise a user terminal3-602 and an image capture device 3-604. Any descriptions herein ofcomponents of the image capture device may include components on-boardthe image capture device itself (e.g., camera). Alternatively, one ormore components may be distributed over a movable object (e.g., UAV)supporting the image capture device.

In some embodiments, the image capture device 3-604 may comprise acontrol module 3-606, a video capture module 3-608, a video processingmodule 3-610 and a video storing module 3-612 which are seriallyconnected. The control module 3-606 may be directly connected to thevideo processing module 3-610. The image capture device 3-604 may alsocomprise a memory device 3-614 which is connected to the videoprocessing module 3-610 and the video storing module 3-612. In someembodiments, the memory device 3-614 may comprise a prologue library3-616, a transition effect library 3-618 and an epilogue library 3-620.The memory device may receive video clips for the libraries from a videostoring module. For instance, one or more of the video clips may becaptured using the video capture module. In other instances, the videoclips may be provided to the memory device from an external source. Insome instances, the memory device will download the data and/or videoclips for libraries. The downloads may occur periodically or in responseto an event. In some examples, the downloads may occur any time theimage capture device is turned on. In other instances, the downloads mayoccur at regular or irregular intervals of time. Optionally, thedownloads may occur in response to a user request to update the memory.Thus, the libraries may be stored in a memory on-board the image capturedevice. Alternatively, the libraries may be stored off-board the imagecapture device and may be accessed by the image capture device. Forinstance, the libraries may be stored on the cloud and may be accessedby the image capture device upon request. The image capture device maybuffer or temporarily store the selected video clips or portions of thelibraries.

In some embodiments, the user terminal 3-602 may be a remote controlleror a smartphone. In some instances, at least one physical button may beprovided on the user terminal. Alternatively, at least one screen, suchas a LCD (Liquid Crystal Display) screen, may be provided on the userterminal. A display of the user terminal may be a touchscreen. The usermay input an instruction by pressing the physical button or soft button.The user may input instructions via a touchscreen (e.g., touching a softkey, tapping, swiping, pinching, expanding, twisting/turning clockwiseor counter clockwise). In some instances, the user terminal 3-602 maycomprise at least a communication unit that permits communication withthe image capture device, and one or more processors that individuallyor collectively to perform at least the operation as discussedhereinabove with reference to FIG. 9 and FIG. 11.

In some embodiments, the image capture device 3-604 may be a camera, aUAV carrying a camera unit, or a mobile phone having a camera module. Insome instances, the image capture device may comprising at least animage sensor configured to capture a video, a communication unit (notshown in FIG. 14) that permits communication with the user terminal, andone or more processors that individually or collectively to perform atleast the operation as discussed hereinabove with reference to FIG. 12and FIG. 13.

In some embodiments, the user terminal 3-602 may browse a plurality ofexisting video clips that are locally stored in, for example the memorydevice 3-614, and select an existing video clip as a video clipappropriate for a prologue, a transition effect or an epilogue. The userselected video clip may be accordingly stored in the prologue library3-616, the transition effect library 3-618 or the epilogue library3-620. Optionally, the user may edit an existing video clip by sendingan instruction to the video processing module 3-610 through the controlmodule 3-606. Optionally, the user may edit a captured video clip bysending an instruction to the video processing module 3-610 through thecontrol module 3-606. The edited video clips may then be accordinglystored in the prologue library 3-616, the transition effect library3-618 or the epilogue library 3-620. Alternatively, the user mayinstruct the video capture module 3-608 to capture a video clip ofcurrent scene, through the control module 3-606, and store the capturedvideo clip to be used as a prologue, a transition effect or an epiloguein one of the prologue library 3-616, the transition effect library3-618 and the epilogue library 3-620.

In some embodiments, the control module 3-606 may receive instructionsfrom the user terminal 3-602, and send the instructions to the videocapture module 3-608 and the video processing module 3-610.

In some embodiments, the video capture module 3-608 may capture a videothat is to be processed, and feed the captured video to the videoprocessing module 3-610. In some instances, the video capture module3-608 may also capture a video clip that is to be used as a prologue, atransition effect or an epilogue, according to an instruction from theuser terminal 3-602.

In some embodiments, the video processing module 3-610 may merge atleast one of a prologue, a transition effect and an epilogue to a videocaptured by the video capture module 3-608. Optionally, the videoprocessing module 3-610 may edit an existing video clip according toinstructions from the user terminal 3-602, such that the edited videoclip may be used as a prologue, a transition effect or an epilogue.Optionally, the video processing module 3-610 may edit video clipscaptured by the video capture module 3-608 such that the edited videoclip may be used as a prologue, a transition effect or an epilogue.

In some embodiments, the video storing module 3-612 may store the mergedvideo, which is processed by the video processing module 3-610, to thememory device 3-614. Optionally, the video storing module 3-612 maystore the edited video clips, which are processed by the videoprocessing module 3-610, to one of the prologue library 3-616, thetransition effect library 3-618 and the epilogue library 3-620.

In some embodiments, the memory device 3-614 may include at least theprologue library 3-616, the transition effect library 3-618 and theepilogue library 3-620 which are used to store the video clips suitablefor a prologue, a transition effect or an epilogue. Optionally, thememory device 3-614 may include a memory for storing the merged video.

FIG. 15 is a flow chart illustrating a method of processing a video, inaccordance with an embodiment of the disclosure.

In step 3-702, a video may be captured at the image capture device. Theoperations in step 3-702 may be identical to those performed in step3-402 as discussed hereinabove with reference to FIG. 12. Similarly, theorder of the steps may be modified. For instance, step 3-702 may occurafter step 3-704 or 3-706.

In step 3-704, a determination may be made at the user terminal onwhether the video captured by the image capture device is to beprocessed. The user may provide an input to a user terminal thatindicates the video is to be processed. For example, the user may inputsignal indicative the determination by pressing a physical button or asoft button provided on the user terminal. A video edit request or anytype of indication of video editing may be an indication that the videois to be processed. The operations in step 3-704 may optionally beidentical to those performed in step 3-102 as discussed hereinabove withreference to FIG. 9.

If the determination in step 3-704 is YES, that is, the captured videois determined to be processed, then in step 3-706, one or more videoclips to be merged to the video captured by the image capture device maybe selected at the user terminal, thereby forming a video edit request.The operations in step 3-706 may be identical to those performed in step3-104 as discussed hereinabove with reference to FIG. 9. Otherwise, ifthe determination in step 3-704 is NO, that is, the captured video isdetermined not to be processed, then the processing flow may return backto step 3-702.

In step 3-708, the video edit request may be provided from the userterminal to the image capture device. The operations in step 3-708 maybe identical to those performed in step 3-106 as discussed hereinabovewith reference to FIG. 9 and those performed in step 3-404 as discussedhereinabove with reference to FIG. 12.

In step 3-710, the one or more video clips may be merged to the capturedvideo at the image capture device. The operations in step 3-710 may beidentical to those performed in step 3-406 as discussed hereinabove withreference to FIG. 12.

In some embodiments, the method of processing a video as shown in FIG.15 may further comprise transmitting the merged video to the userterminal at a definition which may be equal to, lower than, or higherthan a definition in capturing the video by the image capture device.

Optionally, at a user terminal, the user may view the video and/or videoclips to be merged. A video edit request may be formed at the userterminal indicative of how the video and/or video clips are to bemerged. When viewing the video and/or video clips, the user may viewthem at a resolution that may be less than or equal to a resolution atwhich the video and/or video clips are captured and/or created. Thevideo edit request may be provided to an image capture device. The imagecapture device, in response to the video edit request, may return amerged video. The merged video may be at a resolution that may begreater than, equal to, or less than the resolution at which the videoand/or video clips were viewed at the user terminal. The merged videomay be at a resolution that may be greater than, equal to, or less thanthe resolution at which the video and/or video clips were capturedand/or created.

FIG. 16 is diagram illustrating a configuration of a merged videoproduced by a method of processing a video, in accordance with anembodiment of the disclosure.

In some embodiments, the merged video produced by a method of processinga video of the disclosure may comprise a video captured by an imagecapture device and at least one of a prologue, a transition effect andan epilogue. In some instances, the number of transition effect may bemore than one. In some instances, the merged video may not comprise aprologue. Alternatively, the merged video may not comprise a transitioneffect. Optionally, the merged video may not comprise an epilogue.

For instance, the merged video shown in FIG. 16 may comprise a video3-804 captured by the image capture device such as a UAV, one prologue3-802, one epilogue 3-808 and a plurality of transition effects 3-806.The prologue 3-802 may be added at the beginning of the captured video3-804. The epilogue 3-808 may be added at the end of the captured video3-804. The plurality of transition effects 3-806 may be inserted in amiddle of the captured video 3-804.

In some embodiments, one or more prologues may be appended to a video.For instance, multiple prologue clips can be merged at the front of avideo. Similarly, one or more epilogues may be appended to a video.Multiple epilogue clips can be merged at the end of a video. One or moretransition effects may be inserted into a video. Multiple transitioneffects may be inserted into the video adjacent to one another, or maybe spaced apart within the video.

In other embodiments, various video editing techniques may also includesplicing videos side by side so that they may simultaneously play. Thespliced videos may be adjacent to one another or form an array. In someinstances, the spliced videos may simultaneously play in apicture-in-picture format, where at least one of the images is withinanother of the images. Any description herein of merging videos, e.g.,for prologue, transition effect, and/or epilogue may also apply to othervideo editing that may occur.

FIG. 17 is a flowchart showing a video processing method according to anembodiment of present disclosure. The method described in thisdisclosure may be implemented by electronic devices such as UAVs,cameras, remote controllers, mobile phones, tablets, computers, orlaptops that can play video.

At step S101, an edit operation associated with a target video isreceived. As provided in step S101, the edit operation may identifyselected video clips. For example, a target video may be played to auser. The target video being played may be a video being recorded, avideo being previewed, or a video being played back. For instance, thetarget video may be created by being captured by an image capturedevice, such as those described elsewhere herein. The target video maybe captured concurrently with playing the video to the user, or prior toplaying the video to the user. The preview and playback may be performedon a video capture device such as a camera, or on may be played on aseparate device such as a mobile device (e.g., smartphone, tablet), orcomputer (e.g., laptop computer, desktop computer). The devicesdescribed herein may refer to video editing devices. Alternatively, thetarget video may be a pre-stored video that may originate from the imagecapture device or any other source.

During the playing the video, a user may interact with the video toperform an edit operation, such as selecting a video clip. The editoperation may be a single click, a double click or a long press. In someinstances, the video clip may be a part of the target video. Forinstance, a beginning point and/or end point of the video clip may beselected at two points in time of the target video. The beginning pointand/or end point may be identified by clicking, double clicking, longpress, dragging, drag-and-drop or any other user interaction with thevideo. Alternatively, the video clip may be provided from a separatevideo file as the target video. The video clip may be the entirety of aseparate video file, or a portion of the separate video file. Once avideo clip has been selected, the edit operation for the target videomay be deteced and received. If the edit operation is received, videoclips may be identified from the target video according to a ruleassociated with the edit operation.

An implementation of the rules for the edit operations may include anexample where the edit operation is a single click, and a video cliphaving a certain duration may be identified by extending a predeterminedduration in both time directions centering the time point of the singleclick. In some embodiments, a video clip having a duration of 5 s may beidentified by extending 2 s in a starting direction of the target videoand extending 3 s in an ending direction of the target video. In otherembodiments, the extended time may be set to other values. For example,a video clip having a duration of 10 s may be identified by extending 5s in a starting direction of the target video and extending 5 s in anending direction of the target video. The duration of a video clip maybe pre-set, or may be determined by a user. Similar examples areprovided in a discussion of user video edits as provided in FIG. 6above.

In another example, the edit operation may include a double click, and avideo clip may be identified as having a duration between the timepoints of the two clicks. In other examples, the edit operation mayinclude a long press, and a video clip may be identified from the targetvideo as having a duration between the starting time and ending time ofthe long press. In some embodiments, in order to compensate a delay in auser's press with respect to a playing of the video, the starting timeof the long press may be determined by advancing a couple of seconds,such that the identified video clip is a complete one. In someinstances, when a functional button is pressed, the starting time of thepress may be determined as 2 s before the press. In other embodiments, astarting time of the press may be determined as another time periodbefore the press. Alternatively, a starting time of the press may bedetermined as certain time period after the press. Additional example ofuser selections and interactions are discussed in FIG. 6 above.

At step S102 of FIG. 17, attributes of selected video clips arerecorded. The recorded attributes may be utilized in editing the targetvideo or editing video that corresponds to the target video. Inparticular, in step S102, the attributes of the video clips which areselected in S101 may be recorded. The attributes of the video clips maybe recorded in memory on-board a video-editing device. Alternatively,the attributes of the video clips may be recorded in memory on-board avideo clip origination device (e.g., an image capture device if theimage capture device is the source of the video clip).

In some embodiments, the attributes may include a starting time andending time of the video clip in the target video, and the time of beingselected from the target video. In some embodiments, the attributes mayinclude description such as a format, a duration, a memory usage,location of capture, route of capture, a file name of the target videofrom which the video clip is selected, definition and frame rate.

The attributes of the video clip may be used for editing the targetvideo. In some embodiments, when an instruction for video synthesizingis received, all video clips having a certain attribute may be selectedfrom the target video, and the selected video clips may be synthesizedto a video file. In particular, video clips that share a particularattribute may be selected and synthesized. For example, video clips thathave a particular user mark, such as a star rating, may be selected andsynthesized. User marks may include tags that signify a particularrating, such as 3 stars for the best favorite, 2 stars for the betterfavorite, and 1 star for a good favorite. In some instances, video clipsshot at or near a particular location may be selected and synthesized.Such selection may be performed with aid of one or more processors,without requiring a user to manually scan through multiple video clipsto find the video clips of particular attributes. A user may select thetype of attributed to be selected. In some instances, the video clipsmay be sorted in a sequence of time in the target video and thensynthesized. Alternatively, the video clips may be sorted in a sequenceof a time being selected and then synthesized. The rule for synthesizingmay be determined in view of the user's demand.

For devices that are remote to the image capture device (e.g., an aerialshooting apparatus), the video clips may be identified from a bufferedvideo. In this case, the attributes of the video clips may be recorded,and corresponding video clips may be downloaded from a video stored in astorage media onboard an aerial vehicle (e.g., UAV), such as a memorycard, according to the recorded attributes of the video clips. Thememory card may be on-board an image capture device supported by theUAV, or may be on-board the UAV without being on-board the image capturedevice. In some embodiments the memory card may be on-board an imagecapture device that may encompass the UAV. Any description herein ofcomponents on-board the aerial vehicle and/or image device may refer toany combination as described. In some embodiments, if the video storedin a storage media onboard the aerial vehicle has a higher definitionand the locally buffered video has a lower definition, the attributes ofthe video clips may be determined from the video having a lowerdefinition, and then the corresponding video clips may be laterextracted from the video having a higher definition.

When video clips are extracted from the target video, a definition ofthe video clips may be modified. In some embodiments, the definition ofvideo clips may be lowered in order to save space in storage.Alternatively, the definition of video clips may be increased bytechniques such as interpolation.

In some embodiments, when an edit instruction for editing the recordedattributes is received, the attributes of the video clips may be editedaccording to the edit instruction. For example, the edit instruction mayinclude one or more directions to modify the recorded attribute, such asadding new attributes and deleting attributes. Alternatively, portionsof attributes, such as a selected start time of a video clip, may beadded, deleted, or modified.

If the edit instruction is modifying the recorded attribute, thestarting time and/or ending time in the attributes of the video clip maybe modified such that the duration of the video clip may be modified, orthe position of the video clip itself within the video may be shiftedforward or backward. In some embodiments, other attributes such aslocation of capture, format, or definition may be modified.

In an example where the edit instruction is adding new attributes, thetarget video may be played again and an edit operation may be receivedagain. Additional video clip or video clips may be identified by theedit operation, and the attributes of the new video clips may berecorded. In another example where the edit instruction is deletingattributes, attributes of the video clips which are identified by thedeleting instruction may be deleted.

A user may interact with a device, such as a user terminal or otherdevices described herein, to edit the video. In some embodiments, one ormore sessions may be provided during which a user may interact withvideo clips to edit the video. A user may log off between sessions orshut down an editing application between sessions. In some instances,editing may occur within a single session. Alternatively, editing mayspan multiple sessions and a user may be able to save the user'sprogress between sessions. A user may be within an editing phase whilethe user is editing the video. One or more edit instructions may beprovided during the editing phase. The user may play back edited videoduring the editing phase. In some instances, the user may go back andforth between playing back video and making additional edit instructionsduring the editing phase until the user feels the editing is complete.During the editing phase, the video may be local buffered video. Duringthe editing phase, the video may be at a definition lower than or equalto the video as was originally captured or created. When a user hascompleted editing the video, a video synthesizing instruction may beprovided. The video synthesizing instruction may optionally be a videoedit request. In some embodiments, the video clips to form the finalizedvideo may be extracted from a source (e.g., image capture device, aerialvehicle). This may occur during a synthesization phase. The finalizedvideo may be delivered and/or played back. The finalized video may be ata definition greater than, equal to, or lower than the definition atwhich the video was originally captured or created. The finalized videomay be at a definition greater than or equal to the definition of thevideo used during the editing phase.

Additionally, when a video synthesizing instruction is received afterediting the attributes, video clips may be extracted from the targetvideo according to the edited attributes, and a video file may beproduced. In some embodiments, the extracted video clips may be storedin a storage media in certain format, such as MP4 (Moving PictureExperts Group Audio Layer IV), AVI (Audio Video Interleave) or MOV(Quicktime Movie Format). The user may play or preview each of the videoclips, or the user may play or preview a combined file of the videoclips. In some embodiments, the stored video clips may be selectedduring one playing process. Alternatively, the stored video clips may beselected during more than one playing processes. Therefore, a storingtime of the video clips may be different from a time of the video clipsin the target video. In some embodiments, the video clips may becombined based on the order in which the video clips are stored.Alternatively, the video clips may be combined based on the order inwhich the video clips are presented in the target video. Further, theprocess of selecting video clips from the target video, extracting videoclips according to attributes of video clips, and synthesizing the videoclips into a video file may be implemented by dedicated software.

FIG. 18 is a flowchart showing a video processing method according toanother embodiment of present disclosure. As shown in FIG. 18, editoperation relating to target video being played is received at stepS201. In some examples, the target video being played may be a videobeing recorded, a video being previewed, or a video being played back.Additionally, a user may interact with the target video to edit thevideo. Accordingly, at step S202, video clips to be selected by the editoperation are identified. In particular, the video clips to be selectedmay be identified by the edit operation according to a rule associatedwith the edit operation. The edit operation may include any userinteraction with a device, such as a single click, a double click, along press, a swipe or dragging motion, or a drag-and-drop, on aphysical button or functional button of an application. The editoperation may occur with any other type of user interaction via a userinteractive device, such as a mouse, joystick, trackball, touchpad,touchscreen, multi-touch, tilt or shaking of the device, camera, ormicrophone. Any description herein of user interaction of a device mayinclude any of the interactions described herein.

If the edit operation is a single click, a video clip having certainduration may be identified by extending a predetermined duration in bothtime directions centering the time point of the single click.Alternatively, if the edit operation is a double click, a video clip maybe identified as having a duration between the time points of the twoclicks. Further, if the edit operation is a long press, a video clip maybe identified from the target video as having a duration between thestarting time and ending time of the long press. The video clip and/orduration of the video clip may be identified using any other userinteraction.

In step S203, attributes of the video clips are acquired. In someembodiments, the attributes of the video clips may be acquired after thevideo clips are identified from the target video. Additionally, theidentified video clips may be stored.

In step S204, the attributes may be recorded in a file. For example, theattributes of the video clips may be recorded in a file. The name of thefile may be associated with the target video, such that the user mayfind the target video based on a file name of the attributes. In someinstances, the file may be independent from the target video.

An instruction of video synthesizing may be received at step S205. Forexample, an instruction of video synthesizing may be received, and inresponse the video clips identified by the attributes in the file may beextracted from the target video. Additionally, before extracting fromthe target video the video clips identified by the attributes, theattributes may be edited on user's demand. For example, attributes ofcertain video clips may be deleted from the file, attributes of certainvideo clips may be modified, or attributes of new video clips may beadded to the file. In some instances, the attributes may be edited morethan once. Additionally, if the file containing edited attributes ofvideo clips satisfies the user's demand, video clips may be extractedfrom the target video according to the edited attributes in the file.

Optionally, the target video may be stored at a memory on a device thatis remote to a device where an edit operation is received. For example,a user may edit a video at a user terminal and the target video may bestored on an image capture device or on an aerial vehicle (e.g., on theimage capture device on the aerial vehicle, or on the aerial vehiclewhile not being stored on the image capture device). Alternatively, thetarget video stored may be stored at a memory on the same device wherean edit operation is received.

At step S206, the selected video clips may be synthesized into a videofile. When synthesizing the video file, the video clips may be sorted ina sequence of a time in the target video. Alternatively, the video clipsmay be sorted in a sequence of a time being selected. Subsequently, thesorted video clips may be combined into a video file. In someembodiments, the video file may be played directly after beingsynthesized. Alternatively, the video file may be exported to a storagemedia such as a personal computer, a mobile phone, a pad or a mobilehard disk for later play.

The video synthesization may occur on a device having the memory onwhich the target video is stored. Thus, the video synthesization mayoccur locally where the video is stored. For example, a user may edit avideo on a user terminal and the target video may be stored andsynthesized on an image capture device or on an aerial vehicle. Theediting phase may occur on a first device and the synthesization phasemay occur on a second device. The synthesized video may be played backon the image capture device or on the aerial vehicle. The synthesizedvideo may be transmitted to another device, such as the user terminal orany other device to be played back. The synthesized video may betransmitted to a cloud storage or a video streaming site. Alternatively,the video synthesization may occur on a device separate from the deviceon which the target video is originally stored. The target video may betransmitted to another device where the synthesization may occur. Theother device where the synthesization may occur may be the device wherethe editing occurs or may be a separate device. For example, a user mayedit a video on a user terminal the target video may be stored on animage capture device or an aerial vehicle. The target video may betransmitted to the user terminal or a separate device wheresynthesization may occur. The editing phase may occur on a first deviceand the synthesization phase may occur on the same first device, oralternatively on a third device.

In an example where the user opens a new target video and selects videoclips from the new target video, the above procedures may be performedto identify video clips from the new video clip and record theattributes of the video clips in a new file. Further, since theattributes of the video clips are recorded in a preset file, the videoclips may be extracted from the target video according to the attributesrecorded in the file, even if the user fails to store the identifiedvideo clips.

FIG. 19 is a flowchart showing a video processing method according tostill another embodiment of present disclosure. At step S301, editoperations for the target video being played are received. Further, atstep S302, video clips to be selected by the edit operation areidentified according to a rule associated with the edit operation. Atstep S303, attributes of the video clips are acquired. Additionally, atstep S304, the attributes are embedded into the target video. Forexample, a starting time and an ending time of the video clips may betagged on the target video, and the video clips may be assigned a nameor brief description. Alternatively, the time of being identified forthe video clips may be tagged on the target video. The attributes of thevideo clips may be stored together with the target video. The user mayopen the target video, click the identified video clips and preview thevideo clips.

At step S305, instruction of video synthesizing may be received. Forexample, if an instruction of video synthesizing is received, the videoclips identified by the attributes in the file may be extracted from thetarget video. Additionally, if an instruction of video synthesizing isreceived, the target video may be opened, the identified video clips maybe extracted and the video clips may be stored. Further, the selectedvideo clips may be synthesized into a video file at S306. In someinstances, before combining the extracted video clips, the video clipsmay be sorted in a sequence of a time in the target video or in asequence of a time being selected. The sorted video clips may then besynthesized to produce a video file.

Since the attributes of the selected video clips in FIG. 19 are embeddedinto the target video, the user may know the attributes of the videoclips when the target video is played. In some embodiments, the user canedit the attributes of the video clips in real time when the targetvideo is played.

FIG. 20 is a schematic view showing a configuration of a videoprocessing device according to an embodiment of the disclosure. Thevideo processing device may be installed on electronic device which canplay a video such as an aerial vehicle, a mobile phone, an electronicpad, a camera, etc. The video processing device may be a device or maybe installed on a device on which a video editing application may berun. A user may interact with the video processing device to edit avideo. For instance, the video processing device may be installed on auser terminal, which may run a video editing application through whichthe user may edit video.

In some embodiments, the video processing device may include a receivingmodule 4-1 for receiving an edit operation for the target video beingplayed, and for identifying the video clips to be selected by the editoperation; and a recording module 4-2 for recording the attributes ofthe selected video clips such that the target video may be edited.

During the playing of the target video, the user's edit operation of thetarget video may be received, such as user's single click, a doubleclick or a long press. For example, video clips may be identified fromthe target video according to a rule associated with the edit operation.Additionally, the recording module 4-2 may record the attributes of thevideo clips which are selected by the receiving module 4-1 such that thetarget video may be edited according to the stored attributes.

FIG. 21 is a schematic view showing a configuration of a videoprocessing device according to another embodiment of the disclosure. Insome embodiments, the video processing device may be used to edit andsynthesize video. Optionally, an editing phase and synthesizing phasemay occur on the video processing device. The video processing devicemay further include a playing module 4-3, an editing module 4-4, asynthesizing module 4-5 and a processing module 4-6.

The playing module 4-3 may play a video, such as the target video. Insome embodiments, the video played by the playing module 4-3 may be avideo being recorded, a video being previewed, or a video being playedback. Alternatively, the video played by the playing module 4-3 may beselected video clips or the final synthesized video file.

The editing module 4-4 may edit the recorded attributes of the videoclips, including modifying and deleting the attributes of certain videoclips and adding attributes of new video clips. Further, thesynthesizing module 4-5 may synthesize and produce the video file. Theprocessing module 4-6 may perform processings to the video clips.

FIG. 22 is a schematic view showing a configuration of a receivingmodule in the video processing device according to the embodiment shownin FIG. 20 or FIG. 21. In some embodiments, the receiving module mayinclude an edit receiving unit 4-11 and a video tagging unit 4-12. Theedit receiving unit 4-11 may receive edit operation for the target videobeing played, including any of the user interactions described elsewhereherein, such as a single click, a double click or a long press. Thevideo tagging unit 4-12 may identify video clips from the target videobeing played according to a rule associated with the edit operation.

If the edit operation received by the edit receiving unit 4-11 is asingle click, the edit receiving unit 4-11 may identify a video cliphaving certain duration by extending a predetermined duration in bothtime directions centering the time point of the single click. In someembodiments, a video clip having a duration of 5 s may be identified byextending 2 s in a starting direction of the target video and extending3 s in an ending direction of the target video. In some embodiments, theextended time may be set to other values. For example, a video cliphaving a duration of 10 s may be identified by extending 5 s in astarting direction of the target video and extending 5 s in an endingdirection of the target video.

If the edit operation received by the edit receiving unit 4-11 is adouble click, the edit receiving unit 4-11 may identify a video ashaving a duration between the time points of the two clicks.

If the edit operation received by the edit receiving unit 4-11 is a longpress, the edit receiving unit 4-11 may identify a video clip from thetarget video as having a duration between the starting time and endingtime of the long press. In some embodiments, in order to compensate adelay in user's press with respect to a playing of the video, thestarting time of the long press may be determined by advancing a coupleof seconds, such that the identified video clip is a complete one.

In some instances, when a functional button is pressed, the startingtime of the press may be determined as 2 s before the press. In someembodiments, a starting time of the press may be determined as othertime period before the press. Alternatively, a starting time of thepress may be determined as certain time period after the press.

FIG. 23 is a schematic view showing a configuration of the recordingmodule in the video processing device according to the embodiment shownin FIG. 20 or FIG. 21. In some embodiments, the recording module 4-2 mayinclude an attribute acquiring unit 4-21, a first attribute storage unit4-22 and a second attribute storage unit 4-23.

The attribute acquiring unit 4-21 may acquire the attributes of thevideo clips. In some embodiments, the attribute may include a startingtime and ending time of the video clip in the target video, and the timeof being selected. In some embodiments, the attribute may includedescription such as a format, a duration, a memory usage, location ofcapture, route of capture, a file name of the target video from whichthe video clip is selected, definition and frame rate.

The first attribute storage unit 4-22 may record the attributes of thevideo clips acquired by the attribute acquiring unit 4-21 in a presetfile. In some embodiments, when recording the attributes of the videoclips in the file, the name of the file may be associated with thetarget video, such that the user may find the target video based on afile name of the attributes. Alternatively, the file may be assignedanother name. In some instances, the file may be independent from thetarget video.

The second attribute storage unit 4-23 may embed the attributes of thevideo clips acquired by the attribute acquiring unit 4-21 into thetarget video. In some embodiments, a starting time and an ending time ofthe video clips may be tagged on the target video, and the video clipsmay be assigned a name or brief description. Alternatively, the time ofbeing identified for the video clips may be tagged on the target video.The attributes of the video clips may be stored together with the targetvideo. The user may open the target video, click the identified videoclips and preview the video clips.

FIG. 24 is a schematic view showing a configuration of the editingmodule in the video processing device according to the embodiment shownin FIG. 21. The editing module 4-4 may edit the recorded attributes ofthe video clips. In some embodiments, the editing module 4-4 may includean instruction receiving unit 4-41 and an editing unit 4-42. Thereceiving unit 4-41 may receive an edit instruction for editing therecorded attributes. When the receiving unit 4-41 receives an editinstruction for editing the recorded attributes, the editing unit 4-42may edit the recorded attributes of the video clips according to thereceived edit instruction. For example, the attribute editinginstruction may include one or more of modifying the recorded attribute,adding new attributes and deleting attributes.

If the receiving unit 4-41 receives a modifying instruction, one or moreof the starting time, ending time in the attributes of the video clipmay be modified. Alternatively, if the receiving unit 4-41 receives adeleting instruction, attributes of the video clips which are identifiedby the deleting instruction may be deleted. Alternatively, if thereceiving unit 4-41 receives an adding instruction, the target video maybe played again and an edit operation may be received again. Additionalvideo clip or video clips may be identified by the edit operation, andthe attributes of the new video clips may be recorded.

FIG. 25 is a schematic view showing a configuration of the synthesizingmodule in the video processing device according to the embodiment shownin FIG. 21. The synthesizing module 4-5 may include a control unit 4-51and a synthesizing unit 4-52. The synthesizing module 4-5 may edit thevideo according to the attributes of the video clips as recorded by therecording module 4-2.

The control unit 4-51 may extract video clips identified by theattributes from the target video when an instruction of videosynthesizing is received. The synthesizing unit 4-52 may combine thevideo clips extracted by the control unit 4-51 such that a video file isproduced. When synthesizing the video file, the video clips may besorted in a sequence of a time in the target video. Alternatively, thevideo clips may be sorted in a sequence of a time being selected. Insome embodiments, the video file may be played directly after beingsynthesized. Alternatively, the video file may be exported to a storagemedia such as a personal computer, a mobile phone, a tablet or a mobilehard disk for later play.

FIG. 26 is a schematic view showing a configuration of the processingmodule in the video processing device according to the embodiment shownin FIG. 21. The processing module 4-6 may combine the video clips. Insome embodiments, if the recording module 4-2 stores the video clipstagged by the receiving module 4-1, the processing module 4-6 maycombine the stored video clips. The processing module 4-6 may include avideo acquiring unit 4-61 and a video processing unit 4-62. The videoacquiring unit 4-61 may acquire the identified video clips, and thevideo processing unit 4-62 may synthesize the video clips. In combining,the video clips may combine in order to produce a video file. In someembodiments, the video clips may be sorted in a sequence of a time inthe target video or in a sequence of a time being selected, then thesorted video clips may be synthesized to produce a video file.

In present embodiment, the video clips may be identified from the targetvideo, and the attributes of the video clips may be recorded. The usermay synthesize the video file based on the recorded attributes of thevideo clips, and may edit the video file by editing the attributes ofthe video clips. Therefore, no specific video edit software isnecessary. In some embodiments, the video clips may be tagged during thevideo capture, thus there's no need for playback in identifying thevideo clips.

In some embodiments, user input to edit videos may occur with videos oflower resolution. Once the user has completed editing the videos, theattributes and edit instructions may be used to acquire video from atarget video which may be at an equal or higher resolution than thevideos where editing occurred. The video acquisition and videoprocessing to produce the synthesized video file may occur at the equalor higher resolution.

FIG. 27 is a schematic view showing a configuration of a playing deviceaccording to an embodiment of present disclosure. In some embodiments,the playing device may be electronic devices that can play and display avideo, such as UAVs, cameras, mobile phones and tablets. The playingdevice may include a player 4-100 and a processor 4-200. The player4-100 may play a video. The processor 4-200 may receive edit operationfor the target video being played, identify video clips to be selectedby the edit operation, and record the attributes of the selected videoclips, such that the target video can be edited.

In some embodiments, the player 4-100 may play and display a targetvideo. In some instances, the target video may be a video beingrecorded, a video being previewed or a video being played back.Alternatively, the player 4-100 may play the video clips which areidentified from the target video or the final synthesized video file. Insome some examples, the processor 4-200 may receive edit operation forthe target video being played on the player 4-100, identify video clipsto be selected by the edit operation according to a rule associated withthe edit operation, and record the attributes of the selected videoclips. In other examples, the processor 4-200 may receive a singleclick, a double click and a long press on the target video being playedon the player 4-100, identify video clips from the target videoaccording to a rule associated with the edit operation.

If the received edit operation is a single click, a video clip havingcertain duration may be identified from the target video by extending apredetermined duration in both time directions centering the time pointof the single click. Alternatively, if the received edit operation is adouble click, a video clip may be identified from the target video ashaving a duration between the time points of the two clicks. If thereceived edit operation is a long press, a video clip may be identifiedfrom the target video as having a duration between the starting time andending time of the long press.

In some embodiments, the processor 4-200 may acquire the attributes ofthe video clips, and record the attributes of the video clips in apreset file or embed the attributes of the video clips into the targetvideo. In some embodiments, the attribute may include a starting timeand ending time of the video clip in the target video, and the time ofbeing selected. In some embodiments, the attribute may includedescription such as a location of capture, route of capture, anddefinition.

In some embodiments, if the processor 4-200 receives an edit instructionfor editing the recorded attributes, the processor 4-200 may edit theattributes of the video clips according to the edit instruction. Forinstances, the edit instruction may include one or more of modifying therecorded attribute, adding new attributes and deleting attributes.

If the edit instruction is modifying the recorded attribute, theprocessor 4-200 may modify one or more of the starting time and/orending time in the attributes of the video clip. Additionally, if theedit instruction is deleting attributes, the processor 4-200 may deletethe attributes of the video clips which are identified by the deletinginstruction. Further, if the edit instruction is adding new attributes,the target video may be played again and an edit operation may bereceived again. Additional video clip or video clips may be identifiedby the edit operation, and the attributes of the new video clips may berecorded.

In some embodiments, if an instruction of video synthesizing isreceived, the processor 4-200 may extract the video clips identified bythe attributes from the target video, and synthesize the selected videoclips into a video file. In some embodiments, when synthesizing thevideo file, the video clips may be sorted in a sequence of a time in thetarget video and then synthesized. Alternatively, the video clips may besorted in a sequence of a time being selected and then synthesized.

In some embodiments, the processor 4-200 may store the video clipsidentified from the target video. Therefore, if an instruction of videosynthesizing is received, the processor 4-200 may acquire the storedvideo clips and synthesize a video file in a successive order. In someinstances, the video clips may be sorted in a sequence of a time in thetarget video and then synthesized. Alternatively, the video clips may besorted in a sequence of a time being selected and then synthesized.

FIG. 28 is a flowchart showing a video processing method according to anembodiment of the disclosure. The method described in this disclosuremay be implemented by an electronic device such as an aerial vehicle, acamera, a mobile phone, or a tablet that can play video.

At step S2-101 of FIG. 28, an edit rule that is configured by a user isreceived. In particular, the user may select a preset edit rule.Alternatively, the user may customize the edit rule according to ademand. In some examples, before receiving the edit rule configured bythe user, a video data may be received from an image capture deviceand/or an aerial vehicle (e.g., UAV), as a video source such that videoclips may be acquired. The video data captured by the image capturedevice and/or aerial vehicle may be locally buffered and used as targetvideo from which video clips may be extracted. Video editing may occuron the device that captures the video data (e.g., image capture deviceand/or aerial vehicle) or on a separate device (e.g., user terminal).

In some examples, before receiving the edit rule configured by the user,the following operations may be performed: receiving a tagging operationfor the target video being played; extracting video clips being selectedby the tagging operation; generating tags for the selected video clips;and storing the video clips in association with the generated tags in avideo clip library. Alternatively, video clips may be extracted from thetarget video, and a description of each video clip may be stored,including a starting time, an ending time of the video clip in thetarget video, and a definition of the video clip. Then, the video clipsmay be extracted and downloaded from a video stored in a storage mediaof a video source device (e.g., onboard the image capture device and/oraerial vehicle) such as a memory card according to the recordeddescription of the video clips.

When video clips are extracted from the target video, a definition ofthe video clips may be modified. In particular, the definition of videoclips may be lowered in order to save the storage. Alternatively, thedefinition of video clips may be increased by techniques such asinterpolation.

Additionally, the edit operation to the target video being played may bea single click, a double click or a long press. If the edit operation isa single click, a video clip having certain duration may be identifiedby extending a predetermined duration in both time directions centeringthe time point of the single click. In some embodiments, a video cliphaving a duration of 5 s may be identified by extending 2 s in astarting direction of the target video and extending 3 s in an endingdirection of the target video. In some embodiments, the extended timemay be set to other values. For example, a video clip having a durationof 10 s may be identified by extending 5 s in a starting direction ofthe target video and extending 5 s in an ending direction of the targetvideo. If the edit operation is a double click, a video clip may beidentified as having a duration between the time points of the twoclicks.

Further, if the edit operation is a long press, a video clip may beidentified from the target video as having a duration between thestarting time and ending time of the long press. In some embodiments, inorder to compensate a delay in user's press with respect to a playing ofthe video, the starting time of the long press may be determined byadvancing a couple of seconds, such that the identified video clip is acomplete one. In some instances, when a functional button is pressed,the starting time of the press may be determined as 2 s before thepress. In some embodiments, a starting time of the press may bedetermined as other time period before the press. Alternatively, astarting time of the press may be determined as certain time periodafter the press.

When a video clip is selected, one or more of a location of capture, atime of capture and rating information may be generated. For example, atime tag may be generated for the selected video clip according to atime of capture thereof. The time tag may be generated automaticallywithout requiring input by a user. In some instances, the time tag mayprovide a time in accordance with a clock present on a device used tocapture or store the video clip. In some embodiments, a location tag maybe generated for the selected video clip according to a location ofcapture thereof. The location tag may be generated automatically withoutrequiring input by a user. For example, the location tag may be a geotagindicative of the location of a device used to capture the video clip atthe time the video clip is captured. For example, an image capturedevice and/or aerial vehicle may have a GPS locator, or other type oflocator, that may provide location information (e.g., globalcoordinates). In other instances, the location tag may be generated withuser input. For instance, a user may specify the type of location (e.g.,beach, home, forest) or provide any other personalized information aboutthe location. The personalized information from the user may be combinedwith automatic location information. In some embodiments, a rating tagmay be generated for the selected video clip according to a ratingthereof. The rating tag may be automatically generated without userinput. For instance, a default rating may be provided that may bemodified by the user at a later time. In another example, a rating maybe generated based on one or more video parameters (e.g., clarity,contrast, resolution, etc.). The rating tag may be provided based on ainput from a user (a user may rate the video at the time the video iscaptured or at a later time). In some embodiments, any type ofcategorization tag may be generated, that may help a user sort throughvideos at a later time. For instance, the tag may indicate theindividuals or objects shown in the video (e.g., family, herd ofbuffalo). The tag may indicate the purpose of the video (e.g., personaluse, for business). The tag may indicate a type of event that isdepicted by the video (e.g., birthday celebration, concert, graduation).Other types of tags may be generated. The tag (e.g., time tag, locationtag and rating tag) may be generated for a video clip as needed. In someexamples, a rating may be generated when the video clip is generated.For example, a video clip may be assigned as three star, four star orothers. Alternatively, a video clip may be assigned a default rating,and the user may modify the default rating later. The tag may bealternatively be generated after the video clip is generated.

Additionally, if an edit instruction is received, the tags, includingtime tag, location tag and rating tag of the video clip may be editedaccording to the edit instruction. The video clips can be stored inassociated with the corresponding tags in a video clip library.

In some embodiments, the preset edit rule may be one or a combination ofan edit rule of performing editing according to a time tag of the videoclips, an edit rule of performing editing according to a location tag ofthe video clips, and an edit rule of performing editing according to arating tag of the video clips. The user customized edit rule may be acombination of certain rating, time and/or location which are setaccording to the user's demand. Additionally, the target video beingplayed may be a video being recorded, a video being previewed or a videobeing played back.

At step S2-102, video that is acquired from a video clip library isspecified by tags that are indicated by the edit rule. In some examples,if an edit rule configured by a user is a preset edit rule, video clipshaving the tags as indicated by the preset edit rule may be acquiredfrom the video clip library. For instance, video clips having a time tag“today” may be acquired, video clips having a location tag “beach” maybe acquired, or video clips having a rating tag “three star” may beacquired. Any other categories of tags may be applied to the videoclips.

Alternatively, video clips having a combination of time tag, locationtag and rating tag may be acquired. For instance, video clips having atime tag “today” and a location tag “beach” may be acquired.Alternatively, the edit rule may be: acquiring video clips from oneflight route of the aerial vehicle.

In some examples, if the edit rule configured by a user is a usercustomized edit rule, video clips may be acquired according to tags inthe user customized edit rule. For instance, a user customized edit rulemay be: acquiring two video clips having a rating tag “five star”, onevideo clip having a location tag “forest” and one video clip having atime tag “today”. In this case, video clips having such tags may beacquired from the video clip library. The user customized edit rule maybe created by simple edit operations.

At step S2-103, the identified video clips are edited. In particular,after video clips are acquired according to the tags in S2-102, thevideo clips may be synthesized to a video file. The video clips may becombined to produce the video file. Alternatively, the acquired videoclips may be inserted into a template to produce the video file. Thetemplate may be a default template from the system or a templateselected by the user's instruction.

A template for use in generating an edited video file may include one ormore of: a preset audio, a subtitle, and a transition clip. In someembodiments, prologue video clips, epilogue video clips, and/ortransition effect clips may be provided to be merged with a video. Ifthe template includes a transition clip, the video clips and thetransition clips may be arranged according to the user's instructions.In some instances, the video clips may be inserted between thetransition clips, such that the synthesizing of video clips is smoothedby the transition clips. For example, if two video clips having twodifferent levels of brightness are merged/synthesized, a transition clipmay be used to adjust the brightness between the clips. In additionalexamples where the template includes audio and/or subtitles, the videoclips may be synthesized with the audio and/or subtitles to produce avideo file. Further, a synthesized video file may be produced byselecting a template and inserting the video clips.

In other examples, the user may edit the templates, such as modifyingone or more of the audio, subtitles and transition clips. Additionally,the user may edit the video file if an edit instruction is received. Insome instances, if the edit instruction is a reordering instruction,then the video clips may be reordered according to the user's specifiedorder and synthesized to a video file. Alternatively, a “shaking”operation or a pressing on a “synthesize again” button may be performedto reorder the acquired video clips in a random manner. When the clipsare reordered in a random matter, corresponding edits may or may not bereordered with the clips. For example, if certain subtitles wereassociated with certain clips, the subtitles may be reordered with theclips. Alternatively, the subtitles may remain playing in the same orderand rate without regard to whether the video clips are reordered. If theedit instruction is a deleting instruction, the video clips specified bythe deleting instruction may be deleted. If the edit instruction is anadding instruction, new video clips may be acquired from the video cliplibrary according to the tags specified by the adding instruction, andadded to a specified position of the video file. If the edit instructionis an acquiring new video clips instruction, new video clips may beacquired from the video clip library according to the tags specified bythe acquiring new video clips instruction, and a new video file may besynthesized accordingly.

FIG. 29 is a flowchart showing a video processing method according toanother embodiment of present disclosure. At step S2-201 of FIG. 21, anedit rule configured by a user is received. In an example, beforereceiving the edit rule configured by the user, one or more of thefollowing operations may be performed: receiving a tagging editoperation for the target video being played; extracting video clipsbeing selected by a tagging operation; generating tags for the selectedvideo clips; and storing the video clips in associated with thegenerated tags in a video clip library.

Video clips may be selected from the target video by a user interaction,such as a single click, a double click or a long press. After the videoclips have been selected, the selected video clips may be extracted.When a video clip is selected, one or more of a time of capture, alocation of capture and rating information may be generated, such that atime tag, location tag and rating tag may be available. The video clipsare stored and associated with the corresponding tags in a video cliplibrary.

Additionally, the user may select a preset edit rule. Alternatively, theuser may customize the edit rule according to a demand. For example, thepreset edit rule may be one or a combination of an edit rule ofperforming editing according to a time tag of the video clips, an editrule of performing editing according to a location tag of the videoclips, and an edit rule of performing editing according to a rating tagof the video clips. In some examples, the user customized edit rule maybe a combination of certain rating, time and/or location which are setaccording to the user's demand.

At step S2-202, based on tags indicated by the edit rule, video clipswhich are identified by the tags may be acquired from a library. In someexamples, if an edit rule configured by a user is a preset edit rule,video clips having the tags as indicated by the preset edit rule may beacquired from the video clip library. For instance, video clips having acertain time tag, location tag or rating tag may be acquired.Alternatively, video clips from one flight route of the aerial vehiclemay be acquired. If the edit rule configured by a user is a usercustomized edit rule, video clips may be acquired according to tags inthe user customized edit rule.

At step S2-203, video clips may be synthesized as video samples. Forexample, after video clips are acquired according to the tags specifiedby the edit rule, the video clips may be synthesized to a plurality ofvideo samples. Additionally, the video clips may be combined to producethe video samples. Alternatively, the acquired video clips may beinserted into a template to produce the video samples. The template maybe a default template from the system or a template selected by theuser's instruction. In some embodiments, the template may include one ormore template features, which may include one or more of a preset audio,a subtitle and a transition clip.

In some examples, the video clips may be reordered. Further, some videoclips may be deleted or added. Additionally, if the number of videoclips which can be acquired from the video clip library according to thetags specified by the edit rule is larger than a predetermined value,the video clips may be grouped, and a plurality of video samples may beproduced by synthesizing a certain number of video clips from eachgroup. Alternatively, if the number of video clips which can be acquiredfrom the video clip library according to the tags specified by the editrule is smaller than a predetermined value, a certain number of videoclips may be acquired to synthesize a plurality of video samples.

In an example, if 20 video clips are acquired according to a time tag“today”, a certain number of video clips, for example 10 clips or 5clips, may be acquired from the 20 video clips to produce a plurality ofvideo samples. In other instances, if 1,000 video clips are acquiredaccording to a time tag “today”, these 1,000 video clips may be groupedinto a number of groups, for example 10 groups each having 100 videoclips. In this case, a certain number of video clips, for example 20video clips may be acquired to synthesize a plurality of video samples.The number of groups and the number of video clips to be acquired fromeach group may be configured by the user.

In step S2-204, video samples may be shared. In particular, thesynthesized video samples may be uploaded to a sharing platform. In someinstances, the synthesized video samples may be uploaded to a networksuch that viewer may browse. Additionally, assessments on the videosamples may be obtained during the sharing of the video samples at stepS2-205. Viewers may comment on the video samples by providingassessments. Additionally, the number of clicks on a video sample and aduration of viewing a video sample may be recorded. The assessmentinformation such as viewer's assessment, duration of viewing and numberof clicks may be acquired.

At step S2-206, video samples may be screened based on the assessments.For example, a target sample may be selected from the plurality of videosamples based on the assessment information such as viewer's assessment,duration of viewing and number of clicks. In some instances, the targetsample may be stored associated with the assessments in the video cliplibrary.

Additionally, at step S2-207, a determination may be made whetherfurther synthesizing is to be performed. For example, if the number ofacquired video clips is larger than or equal to a predetermined valuebefore the video synthesizing, the video samples may be sub-sampleswhich are synthesized based on video clips acquired from each group.Therefore, in this case, the video samples may be further synthesized toproduce a final video file. Alternatively, if the number of acquiredvideo clips is smaller than a predetermined value before the videosynthesizing, the screened video sample may be kept as the final videofile. In some instances, the final video file may be remained in thesharing platform.

FIG. 30 is a schematic view showing a configuration of a videoprocessing device according to an embodiment of present disclosure. Thedevice described in this disclosure may be, or may be installed on, adisplay apparatus of an electronic device, such as an aerial vehicle(e.g., UAV), a camera, a mobile phone, or a tablet.

Receiving module 5-1 may receive an edit rule configured by the user.The acquiring module 5-2 may acquire from a video clip library videoclips which are specified by the tags in the edit rule. The editingmodule 5-3 may perform one or more edits on the acquired video clips.

The receiving module 5-1 may receive a preset edit rule or a usercustomized edit rule. In some embodiments, the preset edit rule may beone or a combination of an edit rule of performing editing according toa time tag of the video clips, an edit rule of performing editingaccording to a location tag of the video clips, and an edit rule ofperforming editing according to a rating tag of the video clips. In someembodiments, the user customized edit rule may be a combination ofcertain rating, time and/or location which are set according to theuser's demand.

The acquiring module 5-2 may acquire, from a video clip library, videoclips which are specified by the tags indicated by the preset edit ruleor the user customized edit rule. For example, the video clips may beacquired video clips based on certain tags, such as certain time tag,location tag or rating tag. For instance, best video clip of the monthor best video clip of the year may be acquired based on viewers'assessments. In some examples, the video clips may have ratings tagswhich may be used to assess whether the video clips qualify as the bestvideo clip, and the video clips may have timing tags which may be usedto assess whether the video clips qualify as the best video clip in aspecified period of time.

The editing module 5-3 may synthesize the acquired video clips andproduce a video file. In some embodiments, the video clips may becombined to produce the video file. Alternatively, the acquired videoclips may be inserted into a template to produce the video file. Thetemplate may be a default template from the system or a templateselected by the user's instruction. In some embodiments, the user mayedit the video file if an edit instruction is received.

FIG. 31 is a schematic view showing a configuration of a videoprocessing device according to another embodiment of present disclosure.In some embodiments, editing of a video and synthesizing of a video mayoccur on the same device. In some embodiments, the video processingdevice may further include a storing module 5-4, a collecting module 5-5and a screening module 5-6.

The storing module 5-4 may receive a video data from the aerial vehicleas video source such that video clips may be acquired. In someinstances, the storing module 5-4 may locally buffer the video datacaptured by the aerial vehicle and use the buffered video data as targetvideo from which video clips may be extracted.

The collecting module 5-5 may select video clips from the target videoand generate tags for the selected video clips.

The screening module 5-6 may extract video clips from a video cliplibrary, synthesize the video clips to produce a plurality of videosamples, share the synthesized video samples on a sharing platform suchas a network, and screen the synthesized video samples based on viewer'sassessment in order to select a target sample.

FIG. 32 is a schematic view showing a configuration of the acquiringmodule in the video processing device according to the embodiment shownin FIG. 30 or FIG. 31. In some embodiments, the acquiring module 5-2 mayinclude an acquiring unit 5-21 and a first modifying unit 5-22.

The acquiring unit 5-21 may acquire from a video clip library videoclips which are specified by the tags in the edit rule. One or more tagsmay be associated with the acquired video clips. The tags may beacquired with the video clips or may be provided as part of the videoclips.

The first modifying unit 5-22 may edit the tags of the video clips,including time tag, location tag and/or rating tag, according to an editinstruction. Optionally, an edit instruction may be provided by a userinteracting with an editing application on a device. The user mayprovide an input that is indicative of a change to a tag, and thecorresponding tag may be updated.

FIG. 33 is a schematic view showing a configuration of the editingmodule in the video processing device according to the embodiment shownin FIG. 30 or FIG. 31. In some embodiments, the editing module 5-3 mayinclude a synthesizing unit 5-31 and a second modifying unit 5-32.

The synthesizing unit 5-31 may synthesize the video clips acquired fromthe video clip library by the acquiring unit 5-21 to produce a videofile. In some embodiments, the synthesizing unit 5-31 may combine thevideo clips to produce the video file. Alternatively, the synthesizingunit 5-31 may insert the acquired video clips into a template to producethe video file. The template may be a default template from the systemor a template selected by the user's instruction.

The template may include one or more of a preset audio, a subtitle and atransition clip. If the template includes a transition clip, the videoclips and the transition clips may be arranged according to the user'sinstruction. In some instances, the video clips may be inserted betweenthe transition clips, such that the video clips are smoothed by thetransition clips. If the template includes audio and/or subtitles, thevideo clips may be synthesized with the audio and/or subtitles toproduce a video file. The synthesized video file may be browsed byviewers.

The second modifying unit 5-32 may receive an edit instruction forediting the video file and edit the video file according to the editinstruction. In some instances, if the edit instruction is a reorderinginstruction, then the video clips may be reordered according to theuser's specified order and synthesized to a video file. Alternatively, a“shaking” operation or a pressing on a “synthesize again” button may beperformed to reorder the acquired video clips in a random manner. If theedit instruction is a deleting instruction, the video clips specified bythe deleting instruction may be deleted. If the edit instruction is anadding instruction, new video clips may be acquired from the video cliplibrary according to the tags specified by the adding instruction, andadded to a specified position of the video file. If the edit instructionis an acquiring new video clips instruction, new video clips may beacquired from the video clip library according to the tags specified bythe acquiring new video clips instruction, and a new video file may besynthesized accordingly. Additionally, the second modifying unit 5-32may edit the templates, such as modifying one or more of the audio,subtitles and transition clips.

FIG. 34 is a schematic view showing a configuration of the collectingmodule in the video processing device according to the embodiment shownin FIG. 31. In some embodiment, the collecting module 5-5 may include atagging unit 5-51 and a storing unit 5-52. The tagging unit 5-51 mayreceive a tagging operation for the target video being played, andextract video clips being selected by a tagging operation. The storingunit 5-52 may generate tags for the selected video clips, and store thevideo clips in associated with the generated tags in a video cliplibrary. In some embodiments, the storing unit 5-52 may perform one ormore of the following operations: generating time tags for the selectedvideo clip according to a time of capture thereof, generating a locationtag for the selected video clip according to a location of capture, andgenerating a rating tag for the selected video clip according to arating thereof.

FIG. 35 is a schematic view showing a configuration of the screeningmodule in the video processing device according to the embodiment shownin FIG. 31. In some embodiments, the screening module 5-6 may include aselecting unit 5-61, a sample generating unit 5-62, a sharing unit 5-63,a tag acquiring unit 5-64 and a screening unit 5-65.

The selecting unit 5-61 may acquire a certain number of video clipsbased on tags of the video clips. For example, video clips havingcertain tags, such as a certain time tag, location tag or rating tag maybe acquired. Alternatively, video clips from one flight route of theaerial vehicle may be acquired.

Additionally, he sample generating unit 5-62 may synthesize the acquiredvideo clips to produce the video samples. In some embodiments, if thenumber of video clips which can be acquired from the video clip libraryaccording to the tags specified by the edit rule is larger than apredetermined value, the video clips may be grouped, and a plurality ofvideo samples may be produced by synthesizing a certain number of videoclips from each group. Alternatively, if the number of video clips whichcan be acquired from the video clip library according to the tagsspecified by the edit rule is smaller than a predetermined value, acertain number of video clips may be acquired to synthesize a pluralityof video samples. In some instances, the number of groups and the numberof video clips to be acquired from each group may be configured by theuser. In some instances, the predetermined value for determining if thevideo clips should be grouped may also be configured by the user.

The sharing unit 5-63 may upload the video samples to a sharingplatform. In some instances, the sharing unit 5-63 may upload thesynthesized video samples to a network such that viewer may browse.

The tag acquiring unit 5-64 may acquire viewer's assessment. The viewersmay comment on the video samples by providing assessments. In someembodiments, the number of clicks on a video sample and a duration ofviewing a video sample may be recorded. The assessment information suchas viewer's assessment, duration of viewing and number of clicks may beacquired.

The screening unit 5-65 may select a target sample from the plurality ofvideo samples based on the assessment information such as viewer'sassessment, duration of viewing and number of clicks. In this way, bestvideo clip of the month or best video clip of the year may be acquired.

The screening module 5-6 may further include a sample storing unit 5-66.The sample storing unit 5-66 may store the target sample in associatedwith the assessments in the video clip library.

FIG. 36 is a schematic view showing a configuration of a displayingdevice according to an embodiment of present disclosure. The displaydevice described in this disclosure may be a display apparatus of anaerial vehicle, a mobile phone and a pad. In some embodiments, thedisplay device may include a displayer 5-100 and a processor 5-200.

The displayer 5-100 may display the video clips and operationinformation. The processor 5-200 may be configured to, before receivingthe edit rule configured by the user, perform one or more of thefollowing operations: receiving a tagging edit operation for the targetvideo being played; extracting video clips being selected by a taggingoperation; generating tags for the selected video clips; and storing thevideo clips in associated with the generated tags in a video cliplibrary.

In some embodiments, the processor 5-200 may be configured to generatetags for the selected video clips, and store the video clips inassociated with the generated tags in a video clip library. In someembodiments, the storing unit 5-52 may perform one or more of thefollowing operations: generating time tags for the selected video clipaccording to a time of capture thereof, generating a location tag forthe selected video clip according to a location of capture, andgenerating a rating tag for the selected video clip according to arating thereof.

In some embodiments, the processor 5-200 may be configured to receive apreset edit rule or a user customized edit rule. In some instances, thepreset edit rule may be one or a combination of an edit rule ofperforming editing according to a time tag of the video clips, an editrule of performing editing according to a location tag of the videoclips, and an edit rule of performing editing according to a rating tagof the video clips.

In some embodiments, the processor 5-200 be configured to may receive amedit instruction after storing the acquired video clips in associatedwith the corresponding tags in a video clip library, and edit the tagsof the video clips, including time tag, location tag and rating tag,according to an edit instruction.

The processor 5-200 may be configured to synthesize the acquired videoclips to produce a video file. Additionally, the processor 5-200 may beconfigured to perform one or more of the following operations:reordering the video clips in the video file, deleting video clips fromthe video file, and adding new video clips into the video file. Further,the processor 5-200 may be configured to acquire a predetermined numberof video clips from the video clip library based on tags of the videoclips, synthesize the video clips to produce video samples, share thevideo samples, acquire assessment on the video samples and screen thevideo samples to select a target sample according to the assessment.

The processor 5-200 may be configured to store the target sample inassociated with the assessments in the video clip library. Additionally,the processor 5-200 may be configured to acquire a template and insertthe video clips into the template to produce the video file. Thetemplate may include one or more of: a preset audio, a subtitle and atransition clip. Therefore, the final video file may include video,audio and/or subtitles. Further, the processor 5-200 may be configuredto receive user's instruction to may edit the templates, such asmodifying one or more of the audio, subtitles and transition clips.

The present disclosure provides computer control systems that areprogrammed to implement methods of the disclosure. FIG. 37 shows acomputer system 6-501 that is programmed or otherwise configured toprocess video data. The computer system 6-501 can regulate variousaspects of processing video data of the present disclosure, such as, forexample, capturing video data, storing video data, editing video data,and merging video data. The computer system 6-501 can be a processor ofan image capture system, an electronic device of a user, or a computersystem that is remotely located with respect to the electronic device.The electronic device can be a mobile electronic device.

The computer system 6-501 includes a central processing unit (CPU, also“processor” and “computer processor” herein) 6-505, which can be asingle core or multi core processor, or a plurality of processors forparallel processing. The computer system 6-501 also includes memory ormemory location 6-510 (e.g., random-access memory, read-only memory,flash memory), electronic storage unit 6-515 (e.g., hard disk),communication interface 6-520 (e.g., network adapter) for communicatingwith one or more other systems, and peripheral devices 6-525, such ascache, other memory, data storage and/or electronic display adapters.The memory 6-510, storage unit 6-515, interface 6-520 and peripheraldevices 6-525 are in communication with the CPU 6-505 through acommunication bus (solid lines), such as a motherboard. The storage unit6-515 can be a data storage unit (or data repository) for storing data.The computer system 6-501 can be operatively coupled to a computernetwork (“network”) 6-530 with the aid of the communication interface6-520. The network 6-530 can be the Internet, an internet and/orextranet, or an intranet and/or extranet that is in communication withthe Internet. The network 6-530 in some cases is a telecommunicationand/or data network. The network 6-530 can include one or more computerservers, which can enable distributed computing, such as cloudcomputing. The network 6-530, in some cases with the aid of the computersystem 6-501, can implement a peer-to-peer network, which may enabledevices coupled to the computer system 6-501 to behave as a client or aserver.

The CPU 6-505 can execute a sequence of machine-readable instructions,which can be embodied in a program or software. The instructions may bestored in a memory location, such as the memory 6-510. The instructionscan be directed to the CPU 6-505, which can subsequently program orotherwise configure the CPU 6-505 to implement methods of the presentdisclosure. Examples of operations performed by the CPU 6-505 caninclude fetch, decode, execute, and writeback.

The CPU 6-505 can be part of a circuit, such as an integrated circuit.One or more other components of the system 6-501 can be included in thecircuit. In some cases, the circuit is an application specificintegrated circuit (ASIC).

The storage unit 6-515 can store files, such as drivers, libraries andsaved programs. The storage unit 6-515 can store user data, e.g., userpreferences and user programs. The computer system 6-501 in some casescan include one or more additional data storage units that are externalto the computer system 6-501, such as located on a remote server that isin communication with the computer system 6-501 through an intranet orthe Internet.

The computer system 6-501 can communicate with one or more remotecomputer systems through the network 6-530. For instance, the computersystem 6-501 can communicate with a remote computer system of a user(e.g., a user terminal). Examples of remote computer systems includepersonal computers (e.g., portable PC), slate or tablet PC's (e.g.,Apple® iPad, Samsung® Galaxy Tab), telephones, Smart phones (e.g.,Apple® iPhone, Android-enabled device, Blackberry®), or personal digitalassistants. The user can access the computer system 6-501 via thenetwork 6-530.

Methods as described herein can be implemented by way of machine (e.g.,computer processor) executable code stored on an electronic storagelocation of the computer system 6-501, such as, for example, on thememory 6-510 or electronic storage unit 6-515. The machine executable ormachine readable code can be provided in the form of software. Duringuse, the code can be executed by the processor 6-505. In some cases, thecode can be retrieved from the storage unit 6-515 and stored on thememory 6-510 for ready access by the processor 6-505. In somesituations, the electronic storage unit 6-515 can be precluded, andmachine-executable instructions are stored on memory 6-510.

The code can be pre-compiled and configured for use with a machine havea processer adapted to execute the code, or can be compiled duringruntime. The code can be supplied in a programming language that can beselected to enable the code to execute in a pre-compiled or as-compiledfashion.

Aspects of the systems and methods provided herein, such as the computersystem 6-501, can be embodied in programming. Various aspects of thetechnology may be thought of as “products” or “articles of manufacture”typically in the form of machine (or processor) executable code and/orassociated data that is carried on or embodied in a type of machinereadable medium. Machine-executable code can be stored on an electronicstorage unit, such memory (e.g., read-only memory, random-access memory,flash memory) or a hard disk. “Storage” type media can include any orall of the tangible memory of the computers, processors or the like, orassociated modules thereof, such as various semiconductor memories, tapedrives, disk drives and the like, which may provide non-transitorystorage at any time for the software programming. All or portions of thesoftware may at times be communicated through the Internet or variousother telecommunication networks. Such communications, for example, mayenable loading of the software from one computer or processor intoanother, for example, from a management server or host computer into thecomputer platform of an application server. Thus, another type of mediathat may bear the software elements includes optical, electrical andelectromagnetic waves, such as used across physical interfaces betweenlocal devices, through wired and optical landline networks and overvarious air-links. The physical elements that carry such waves, such aswired or wireless links, optical links or the like, also may beconsidered as media bearing the software. As used herein, unlessrestricted to non-transitory, tangible “storage” media, terms such ascomputer or machine “readable medium” refer to any medium thatparticipates in providing instructions to a processor for execution.

Hence, a machine readable medium, such as computer-executable code, maytake many forms, including but not limited to, a tangible storagemedium, a carrier wave medium or physical transmission medium.Non-volatile storage media include, for example, optical or magneticdisks, such as any of the storage devices in any computer(s) or thelike, such as may be used to implement the databases, etc. shown in thedrawings. Volatile storage media include dynamic memory, such as mainmemory of such a computer platform. Tangible transmission media includecoaxial cables; copper wire and fiber optics, including the wires thatcomprise a bus within a computer system. Carrier-wave transmission mediamay take the form of electric or electromagnetic signals, or acoustic orlight waves such as those generated during radio frequency (RF) andinfrared (IR) data communications. Common forms of computer-readablemedia therefore include for example: a floppy disk, a flexible disk,hard disk, magnetic tape, any other magnetic medium, a CD-ROM, DVD orDVD-ROM, any other optical medium, punch cards paper tape, any otherphysical storage medium with patterns of holes, a RAM, a ROM, a PROM andEPROM, a FLASH-EPROM, any other memory chip or cartridge, a carrier wavetransporting data or instructions, cables or links transporting such acarrier wave, or any other medium from which a computer may readprogramming code and/or data. Many of these forms of computer readablemedia may be involved in carrying one or more sequences of one or moreinstructions to a processor for execution.

The computer system 6-501 can include or be in communication with anelectronic display 6-535 that comprises a user interface (UI) 6-540 forproviding, for example, a user terminal that plays video and/or allows auser to edit video at the user interface 6-540. Examples of UI'sinclude, without limitation, a graphical user interface (GUI) andweb-based user interface.

Methods and systems of the present disclosure can be implemented by wayof one or more algorithms. An algorithm can be implemented by way ofsoftware upon execution by the central processing unit 6-505. Thealgorithm can, for example, prioritize processing of video clips ororder video clips that are to be merged.

The systems, devices, and methods described herein can be applied to awide variety of objects, including movable objects and stationaryobjects. As previously mentioned, any description herein of an aerialvehicle, such as a UAV, may apply to and be used for any movable object.Any description herein of an aerial vehicle may apply specifically toUAVs. A movable object of the present disclosure can be configured tomove within any suitable environment, such as in air (e.g., a fixed-wingaircraft, a rotary-wing aircraft, or an aircraft having neither fixedwings nor rotary wings), in water (e.g., a ship or a submarine), onground (e.g., a motor vehicle, such as a car, truck, bus, van,motorcycle, bicycle; a movable structure or frame such as a stick,fishing pole; or a train), under the ground (e.g., a subway), in space(e.g., a spaceplane, a satellite, or a probe), or any combination ofthese environments. The movable object can be a vehicle, such as avehicle described elsewhere herein. In some embodiments, the movableobject can be carried by a living subject, or take off from a livingsubject, such as a human or an animal. Suitable animals can includeavines, canines, felines, equines, bovines, ovines, porcines, delphines,rodents, or insects.

The movable object may be capable of moving freely within theenvironment with respect to six degrees of freedom (e.g., three degreesof freedom in translation and three degrees of freedom in rotation).Alternatively, the movement of the movable object can be constrainedwith respect to one or more degrees of freedom, such as by apredetermined path, track, or orientation. The movement can be actuatedby any suitable actuation mechanism, such as an engine or a motor. Theactuation mechanism of the movable object can be powered by any suitableenergy source, such as electrical energy, magnetic energy, solar energy,wind energy, gravitational energy, chemical energy, nuclear energy, orany suitable combination thereof. The movable object may beself-propelled via a propulsion system, as described elsewhere herein.The propulsion system may optionally run on an energy source, such aselectrical energy, magnetic energy, solar energy, wind energy,gravitational energy, chemical energy, nuclear energy, or any suitablecombination thereof. Alternatively, the movable object may be carried bya living being.

In some instances, the movable object can be an aerial vehicle. Forexample, aerial vehicles may be fixed-wing aircraft (e.g., airplane,gliders), rotary-wing aircraft (e.g., helicopters, rotorcraft), aircrafthaving both fixed wings and rotary wings, or aircraft having neither(e.g., blimps, hot air balloons). An aerial vehicle can beself-propelled, such as self-propelled through the air. A self-propelledaerial vehicle can utilize a propulsion system, such as a propulsionsystem including one or more engines, motors, wheels, axles, magnets,rotors, propellers, blades, nozzles, or any suitable combinationthereof. In some instances, the propulsion system can be used to enablethe movable object to take off from a surface, land on a surface,maintain its current position and/or orientation (e.g., hover), changeorientation, and/or change position.

The movable object can be controlled remotely by a user or controlledlocally by an occupant within or on the movable object. The movableobject may be controlled remotely via an occupant within a separatevehicle. In some embodiments, the movable object is an unmanned movableobject, such as a UAV. An unmanned movable object, such as a UAV, maynot have an occupant onboard the movable object. The movable object canbe controlled by a human or an autonomous control system (e.g., acomputer control system), or any suitable combination thereof. Themovable object can be an autonomous or semi-autonomous robot, such as arobot configured with an artificial intelligence.

The movable object can have any suitable size and/or dimensions. In someembodiments, the movable object may be of a size and/or dimensions tohave a human occupant within or on the vehicle. Alternatively, themovable object may be of size and/or dimensions smaller than thatcapable of having a human occupant within or on the vehicle. The movableobject may be of a size and/or dimensions suitable for being lifted orcarried by a human. Alternatively, the movable object may be larger thana size and/or dimensions suitable for being lifted or carried by ahuman. In some instances, the movable object may have a maximumdimension (e.g., length, width, height, diameter, diagonal) of less thanor equal to about: 2 cm, 5 cm, 10 cm, 50 cm, 1 m, 2 m, 5 m, or 10 m. Themaximum dimension may be greater than or equal to about: 2 cm, 5 cm, 10cm, 50 cm, 1 m, 2 m, 5 m, or 10 m. For example, the distance betweenshafts of opposite rotors of the movable object may be less than orequal to about: 2 cm, 5 cm, 10 cm, 50 cm, 1 m, 2 m, 5 m, or 10 m.Alternatively, the distance between shafts of opposite rotors may begreater than or equal to about: 2 cm, 5 cm, 10 cm, 50 cm, 1 m, 2 m, 5 m,or 10 m.

In some embodiments, the movable object may have a volume of less than100 cm×100 cm×100 cm, less than 50 cm×50 cm×30 cm, or less than 5 cm×5cm×3 cm. The total volume of the movable object may be less than orequal to about: 1 cm³, 2 cm³, 5 cm³, 10 cm³, 20 cm³, 30 cm³, 40 cm³, 50cm³, 60 cm³, 70 cm³, 80 cm³, 90 cm³, 100 cm³, 150 cm³, 200 cm³, 300 cm³,500 cm³, 750 cm³, 1000 cm³, 5000 cm³, 10,000 cm³, 100,000 cm³3, 1 m³, or10 m³. Conversely, the total volume of the movable object may be greaterthan or equal to about: 1 cm³, 2 cm³, 5 cm³, 10 cm³, 20 cm³, 30 cm³, 40cm³, 50 cm³, 60 cm³, 70 cm³, 80 cm³, 90 cm³, 100 cm³, 150 cm³, 200 cm³,300 cm³, 500 cm³, 750 cm³, 1000 cm³, 5000 cm³, 10,000 cm³, 100,000 cm³,1 m³, or 10 m³.

In some embodiments, the movable object may have a footprint (which mayrefer to the lateral cross-sectional area encompassed by the movableobject) less than or equal to about: 32,000 cm², 20,000 cm², 10,000 cm²,1,000 cm², 500 cm², 100 cm², 50 cm², 10 cm², or 5 cm². Conversely, thefootprint may be greater than or equal to about: 32,000 cm², 20,000 cm²,10,000 cm², 1,000 cm², 500 cm², 100 cm², 50 cm², 10 cm², or 5 cm².

In some instances, the movable object may weigh no more than 1000 kg.The weight of the movable object may be less than or equal to about:1000 kg, 750 kg, 500 kg, 200 kg, 150 kg, 100 kg, 80 kg, 70 kg, 60 kg, 50kg, 45 kg, 40 kg, 35 kg, 30 kg, 25 kg, 20 kg, 15 kg, 12 kg, 10 kg, 9 kg,8 kg, 7 kg, 6 kg, 5 kg, 4 kg, 3 kg, 2 kg, 1 kg, 0.5 kg, 0.1 kg, 0.05 kg,or 0.01 kg. Conversely, the weight may be greater than or equal toabout: 1000 kg, 750 kg, 500 kg, 200 kg, 150 kg, 100 kg, 80 kg, 70 kg, 60kg, 50 kg, 45 kg, 40 kg, 35 kg, 30 kg, 25 kg, 20 kg, 15 kg, 12 kg, 10kg, 9 kg, 8 kg, 7 kg, 6 kg, 5 kg, 4 kg, 3 kg, 2 kg, 1 kg, 0.5 kg, 0.1kg, 0.05 kg, or 0.01 kg.

In some embodiments, a movable object may be small relative to a loadcarried by the movable object. The load may include a payload and/or acarrier, as described in further detail elsewhere herein. In someexamples, a ratio of a movable object weight to a load weight may begreater than, less than, or equal to about 1:1. In some instances, aratio of a movable object weight to a load weight may be greater than,less than, or equal to about 1:1. Optionally, a ratio of a carrierweight to a load weight may be greater than, less than, or equal toabout 1:1. When desired, the ratio of an movable object weight to a loadweight may be less than or equal to: 1:2, 1:3, 1:4, 1:5, 1:10, or evenless. Conversely, the ratio of a movable object weight to a load weightcan also be greater than or equal to: 2:1, 3:1, 4:1, 5:1, 10:1, or evengreater.

In some embodiments, the movable object may have low energy consumption.For example, the movable object may use less than about: 5 W/h, 4 W/h, 3W/h, 2 W/h, 1 W/h, or less. In some instances, a carrier of the movableobject may have low energy consumption. For example, the carrier may useless than about: 5 W/h, 4 W/h, 3 W/h, 2 W/h, 1 W/h, or less. Optionally,a payload of the movable object may have low energy consumption, such asless than about: 5 W/h, 4 W/h, 3 W/h, 2 W/h, 1 W/h, or less.

While some embodiments of the present disclosure have been shown anddescribed herein, it will be obvious to those skilled in the art thatsuch embodiments are provided by way of example only. Numerousvariations, changes, and substitutions will now occur to those skilledin the art without departing from the disclosure. It should beunderstood that various alternatives to the embodiments of thedisclosure described herein may be employed in practicing thedisclosure. It is intended that the following claims define the scope ofthe invention and that methods and structures within the scope of theseclaims and their equivalents be covered thereby.

What is claimed is:
 1. A method of processing a video, said methodcomprising: capturing, by a movable object, a first set of video data ata first definition; transmitting, by the movable object, the first setof video data at a second definition wirelessly to a user terminal,wherein the first definition is higher than the second definition;receiving, by the movable object, a video edit request wirelessly fromthe user terminal, wherein the video edit request is formed from editingthe received first set of video data at the second definition at theuser terminal; and finding video corresponding to edited video datadescribed by the video edit request, thereby forming a second set ofvideo data at a third definition; wherein editing the received first setof video data at the second definition includes: selecting one or morevideo clips; and performing one of more of: changing an order of the oneor more video clips; adding a video filtering effect to the one or morevideo clips; editing a duration of the one or more video clips; deletingthe one or more video clips; adding another one or more video clips;presenting in a slow motion mode and presenting in a fast playing mode;and combining the one or more video clips into a single video file. 2.The method of claim 1, wherein the third definition is lower than thefirst definition and higher than the second definition.
 3. The method ofclaim 1, wherein the first definition is at least a 4K definition andwherein the first set of video data is captured through a 4K or greaterimage capture device carried by the movable object.
 4. The method ofclaim 1, wherein the first, second, and third definitions are determinedbased on any one or more of a framerate, a visual quality, and aresolution.
 5. The method of claim 1, wherein the movable object is anunmanned aerial vehicle (UAV) carrying an image capture device, andwherein the first set of video data is captured while the UAV is inflight.
 6. The method of claim 5, wherein the second set of video datais formed while the UAV is in flight.
 7. The method of claim 1, whereinselecting the one or more video clips includes selecting the one or morevideo clips according to a video clip tag or a video clip description.8. The method of claim 1, wherein editing the first set of video dataincludes storing a video clip description of each of the one or morevideo clips in the video edit request, and wherein the video clipdescription includes a starting time and an ending time of the videoclip among the first set of video data, or a rating of the video clip.9. The method of claim 1, wherein editing the first set of video datafurther includes editing audio data, and wherein the audio data isselected from a local audio source of an image capture device capturingthe first set of video data.
 10. The method of claim 1, wherein editingthe first set of video data further includes editing audio data, andwherein the audio data is selected from an online audio source.
 11. Themethod of claim 1, wherein editing the received first set of video datafurther includes accommodating the first set of video data at the seconddefinition in a template, the template being selected from an onlinesource and including an audio and a subtitle.
 12. The method of claim 1,wherein editing the received first set of video data further includesaccommodating the first set of video data at the second definition in atemplate, the template being selected from an online source andincluding a prologue, an epilogue, and a transition effect.
 13. Amovable object for processing a video, said movable object comprising: acommunication unit that permits wireless communication with a userterminal; and one or more processors that individually or collectively:acquire a first set of video data at a first definition; transmit thefirst set of video data at a second definition wirelessly to a userterminal, wherein the first definition is higher than the seconddefinition; receive a video edit request wirelessly from the userterminal, wherein the video edit request is formed from editing thereceived first set of video data at the second definition at the userterminal; and find video corresponding to edited video data described bythe video edit request, thereby forming a second set of video data at athird definition; wherein editing the received first set of video dataat the second definition includes: selecting one or more video clips;and performing one of more of: changing an order of the one or morevideo clips; adding a video filtering effect to the one or more videoclips; editing a duration of the one or more video clips; deleting theone or more video clips; adding another one or more video clips;presenting in a slow motion mode and presenting in a fast playing mode;and combining the one or more video clips into a single video file. 14.The movable object of claim 13, wherein the third definition is lowerthan the first definition and higher than the second definition.
 15. Themovable object of claim 13, wherein editing the received first set ofvideo data further includes accommodating the first set of video data atthe second definition in a template, the template being selected from anonline source and including an audio and a subtitle.
 16. The movableobject of claim 13, wherein editing the received first set of video datafurther includes accommodating the first set of video data at the seconddefinition in a template, the template being selected from an onlinesource and including a prologue, an epilogue, and a transition effect.17. The movable object of claim 13, wherein the movable object is anunmanned aerial vehicle (UAV) carrying an image capture device, andwherein the first set of video data is captured while the UAV is inflight.
 18. A method of processing a video, said method comprising:capturing, by a movable object, a first set of video data at a firstdefinition; transmitting, by the movable object, the first set of videodata at a second definition wirelessly to a user terminal, wherein thefirst definition is higher than the second definition; receiving, by themovable object, a video edit request wirelessly from the user terminal,wherein the video edit request is formed from editing the received firstset of video data at the second definition at the user terminal, and thevideo edit request includes a route that the movable object travelswhile capturing target video data, the target video data correspondingto edited video data described by the video edit request; and findingthe target video data, thereby forming a second set of video data at athird definition, including finding the target video data according tothe route.
 19. The method of claim 18, wherein editing the receivedfirst set of video data includes adding a video clip to the first set ofvideo data as at least one of a prologue, an epilogue, or a transitioneffect, the video edit request including a tag of the video clip; themethod further comprising: obtaining, based on the tag of the videoclip, the video clip from a local memory of the movable object; andforming the second set of video data at the third definition by mergingthe video clip to the target video data as at least one of the prologue,the epilogue, or the transition effect.
 20. The method of claim 18,further comprising: capturing, by the movable object, a video at thefirst definition, the video including the first set of video data; andin response to determining that the video edit request indicating addingan external video clip as a transition effect: suspending capturing thevideo, receiving the external video clip, and merging the external videoclip into the target video data as the transition effect; and aftercompletion of receiving and merging the external video clip as thetransition effect, resuming capturing the video.